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(54) Altered thermostable DNA polymerases for sequencing 

(57) Modified thermostable DNA polymerases hav- 
ing enhanced efficiency for incorporating unconven- 
tional nucleotides such as those labeled witii 
fluorescein family dyes are provided. Such modified 
thernrx)Stable DNA polymerases can be prepared by 
methods of the recombinant DNA technology. Recom- 
binant thermostable DNA polymerases prepared in 
accordance with the present invention are advanta- 
geous in many in vitro DNA synthesis applications, such 
as DNA sequencing, synthesis of labeled DNA and the 
production of labeled primer extension products. The pix\Q\^ 
sM polymerase enzymes are particularly useful in 
chain tennination nucleic add sequencing methods. 
Nucleic adds encoding the recombinant themiostable 
DNA polymerases of the present invention, as well as 
vectors and host cells comprising same are also pro- 
vided. Kits comprising the recombinant thermostable 
DNA polymerases of the present invention are provided 
toa The polymerases and methods of the present 
invention provUe significant cost and efficiency advan- 
tages for DNA sequendng. 
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Description 

Held of the Invention 

5 [0001 ] The present invention relates to thermostalDle DNA polymerases which have enhanced efficiency for incorpo- 
rating nucleoside triphosphates labeled with fluorescein faniily dyes. The present invention provides means for isolating 
and producing such altered polymerases. The enzymes of the invention are useful for many applications in molecular 
biology and are particularly advantageous for nucleic add sequencing. 

10 Background of the Invention 

[0002] Incorporation of nucleoside triphosphates (dNTPs) labeled with fluorescent dyes is important for many in vitro 
DNA synthesis applications. For example, dye-terminator DNA sequencing reactions require the incorporation of fluo- 
rescent dideoxynudeotide analogues for termination and labeling. In addition, in vitro synthesis of labeled products 
15 may involve incorporation of fluorescent nudeotides or nucleotide analogues. For example, f luorescentiy labeled DNA 
has been used in hybridization assays using microarrays of imnx>bilized probes (Cronin et al., 1996, Human Mutation 
7:244). 

10003] To assure fidelity of DNA replication. DNA polymerases have a very strong bias for incorporation of their nor- 
mal substrates, referred to herein as conventional deoxynudeoside triphosphates (dNTPs), and against incorporation 

20 of unconventional dNTPs induding dNTPs and dNTP analogues labeled with fluorescent dyes. In the cell, this property 
attenuates the incorporation of abnormal bases such as dUTP in a growing DNA strand. In vitro, this characteristic is 
particularly evident where botii conventional and unconventional fluorescentiy-labeled nucleoside triphosphates are 
present, such as in DNA sequendng reactions using a version of the dideoxy chain termination method that utilizes dye- 
terminators (Lee et a/.. 1992, Nuc. Acids. Res. 20:2471 which is incorporated herein by reference). 

25 [0004] Commercially available DNA cycle sequendng kits for dye-terminator methods use chain terminator ddNTPs 
labeled with fluorescent dyes of the rhodamine family. However, rhodamine dyes are zwitterionic in charge and nudeo- 
side triphosphates labeled with these dyes migrate anomalously in the electrophoretic gels used to separate tiie 
sequendng products for detection. This property of rhodamine family dyes necessitates making modifications in tiie 
standard sequencing protocol which include the use of dITP and an additional processing step before electi'ophoresis. 

30 [0005] In contrast, negatively charged fluorescent dyes such as fluorescein family dyes allow 1) better separation 
between the labeled nudeoside triphosphates and labeled primer extension products, and 2) better electrophoretic 
migration of tiie labeled sequendng products tiian neuUal or positively charged fluorescent dyes. Thus, the use of flu- 
orescein family dyes avoids the need for additional processing steps required with the use of rhodamine family dyes. 
However, available dyes of the fluorescein family are not ideal for use in current commercially available DNA cycle 

35 sequendng formats because ddNTPs labeled with these dyes are not effidently incorporated into sequencing products 
using these formats. Consequently, there is a need for commerdally available thermostable DNA polymerases that can 
effidentiy incorporate both conventional and f luorescein-labeled nudeotides. The present invention serves to meet that 
need. Further, an unexpected property of the mutant enzymes of this invention is the increased rate of primer extension 
relative to the corresponding wild-type enzyme. Another unexpected prq^erty is the increased uniformity of incorpora- 

40 tion of the various terminator nudeotides in automated DNA sequence analysis. 

Summarv of the Invention 

[0006] The present invention provides template-dependent thermostable DNA polymerase enzymes having reduced 
45 discrimination against incorporation of nudeotides labeled with fluorescein family dyes compared to previously charac- 
terized enzymes. These enzymes incorporate nucleotides, including deoxynucleotides (dNTPs) and base analogues 
such as dideoxynudeotides (ddNTPs), that are labeled with fluorescein family dyes more effidently than conventional 
thermostable enzymes. Genes encoding these enzymes are also provided by the present invention, as are recombinant 
expression vectors for providing large amounts of purified enzymes. 
50 (0007] By the present invention, a region of criticality within thermostable DNA polymerases is identified which affects 
the polymerase's ability to incorporate nudeotides labeled with fluorescein family dyes, while retaining the ability to 
incorporate faithfully natural nudeotides. This region of criticaKty. or Critical Motif, can be introduced into genes for ther- 
mostable DNA polymerases by recombinant DNA methods such as sHe-specific nwtagenesis to provide the advan- 
tages of the invention. 

55 [0008] Thus, in one aspect, the invention provides recombinant thernrwstable DNA polymerase enzymes which are 
characterized in that the enzymes have been mutated to produce the Critical l^otif and have reduced discrimination 
against incorporation of nucleotides labeled with fluorescein family dyes, in comparison to the corresponding wild-type 
enzyme. 
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[0009] In this aspect, the invention provides recomtsinant thernrx>stat)le DNA polymerase enzymes which are charac- 
terized in that a) In its native form said polymerase conrprises the amino acid sequence (given in one-letter code) 
LSXXLX(V/I)PXXE (SEQ ID NO: 1), where X is any amino add; b) the X at position 4 in said sequence is mutated in 
comparison to said native sequence, except that X is not mutated to E; and c) said thermostable DNA polymerase has 
5 reduced discrimination against incorporation of nucleotides labeled with fluorescein family dyes in comparison to the 
native form of said enzyme. In the three-letter code, this amino acid sequence is represented as LeuSerXaaXaaLeuX- 
aaXaaProXaaXaaGlu (SEQ ID NO: 1). whereby "Xaa" at positions 3. 4. 6. 9, and 10 of this sequence are any amino 
acid residue, and **Xaa" at position 7 of this sequence ts Val or lie. 

[0010] In another embodiment, the recombinant thermostable DNA polymerases are characterized in that a) the 

10 native form of the polymerase comprises the amino add sequence LS(Q/G)XL(S/A)IPYEE (SEQ ID NO: 2), where X is 
any amino add; b) the X at position 4 in said sequence is mutated in comparison to said native sequence, except that 
X is not mutated to E; and c) said thermostable DNA polymerase has reduced discrimination against incorporation of 
nudeotides labeled with fluorescein family dyes in comparison to the native form of said enzyme. In the three-letter 
code, this amino acid sequence is represented as LeuSerXaaXaaLeuXaalleProTyKSluGlu (SEQ ID NO: 2). whereby 

15 "Xaa" at position 3 is Gin or Gly. "Xaa" at position 4 is any amino add, and "Xaa" at posrtion 6 is Ser or Ala. In a preferred 
embodiment, the amino acid sequence is LSQXLAIPYEE (SEQ ID N0:3). where X is any amino acid. In the three-letter 
code, this amino add sequence is represented as LeuSerGlnXaaLeuAlalleProTyrGluGlu (SEQ ID N0:3), whereby 
"Xaa" at position 4 Is any amino acid. In a nnore preferred embodiment, the "Xaa" at position 4 is Lys. 
[001 1] In yet another embodiment, the recombinant thermostable DNA polymerases are characterized in that a) the 

20 native form of the polymerase comprises the amino add sequence LSVXLG(V/I)PVKE (SEQ ID NO: 4); b) the X at posi- 
tion 4 in said sequence is mutated in comparison to said native sequence, except that X is not mutated to E; and c) said 
thermostable DNA polymerase has reduced disaimination against incorporation of nucleotides labeled with fluorescein 
family dyes in conparison to the native form of said enzyme. In the three-letter code, this amino acid sequence is rep- 
resented as LeuSerValXaaLeuGlyXaaProValLysGu (SEQ ID NO: 4). whereby "Xaa" at posrtion 4 is any amino acid and 

25 "Xaa" at position 7 is Val or lie. In a preferred enixxJIment the amino add sequence is LSVXLGVPVKE (SEQ ID NO: 
5) where X at posrtion 4 is any amino acid. In the three-letter code, this amino acid sequence is represented as LeuS- 
erValXaaLeuGlyValProValLysGIu (SEQ ID NO: 5). whereby "Xaa" at position 4 is any amino add. In a more preferred 
ennbodiment, the "Xaa" at position 4 is Arg. In another preferred enixxllment, the amino add sequence is LSVXL- 
GIPVKE (SEQ ID NO: 6) where X at position 4 is any amino add. In the three-letter code, this amino add sequence is 

30 represented as LeuSerValXaaLeuGlylleProValLysGlu (SEQ ID NO: 6), whereby "Xaa" at position 4 is any amino acid. 
In a more preferred embodiment, the "Xaa" at position 4 is Arg. 

[0012] In another aspect of this invention, the particular region of CTiticallty of this invention can be combined with 
motifs in other regions of the polymerase gene that are known to provide thernriostable DNA polymerases with reduced 
discrimination against incorporation of unconventional nudeotides such as rNTPs and ddNTPs. As exenplif led herein, 

35 a recombinant Thermus aquaticus (Taq) DNA polymerase enzyme containing two mutations was constructed. The first 
nuitation was an E to K nruitation in the X residue at position 4 of the critical motif of this invention. The second mutation 
was a mutation allowing more effident incorporation of ddNTPs known as the F667Y mutation. This mutation is a phe- 
nylalanine to tyrosine nurtatlon at position 667 of Taq DNA polymerase (described in US. Patent No 5,61 4.365 and U.S. 
Serial No 8/448.223 and herein incorporated by reference). When used in a sequencing reation with fluorescein dye 

40 family-labeled ddNTPs. the E681 K F667Y double mutant enzyme was found to produce a readable sequendng ladder. 
Thus, in one embodiment, a motif conferring reduced discrimination toward dideoxynucleotides is combined with the 
critical motif of this invention to provide an enzyme having an increased efficiency of incorporation of both labeled and 
unlabeled ddNTPs. 

[0013] In addition, the E681K F667Y mutant enzyme was unexpectedly found to exhibit a significantly inaeased 
45 extension rate relative to an enzyme with the F667Y mutation alone. Thus, in another embodiment of the invention, 
introduction of the aitical motif into a thermostable DNA polymerase enzyme, alone or in combination with other rrwtifs, 
produces enzymes having an increased extension rale. The double mutant enzyme was also unexpectedly found to 
produce nx)re uniform peak heights in dye-terminator dideoxy sequendng using rhodamine-labeled terminators. Thus, 
in yet another embodiment, introduction of the critical motif into a thermostable DNA polymerase enzyme produces 
so enzymes displaying mae uniform peak heights in DNA sequendng methods using rhodamine dye family labeled ter- 
minators. 

[0014] In another embodiment, a mutation allowing more effident incorporation of rNTPs, such as the glutamic add 
to glydne mutation at posrtion 615 of Taq DNA polymerase, or E615G mutation (described in European Patent Appli- 
cation. Publication No. EP-A-823 479, herein incorporated by reference), is combined with the critical motif of this inven- 
55 tion to provide an enzyme having an inaeased efficiency of incorporation of ribonudeotides labeled with fluorescein 
family dyes. 

[001 5] In another aspect of this invention, genes encoding the polymerases of this invention are also provided. Spe- 
cKically. genes encoding recombinant thermostable polymerases conr^rising the critical motif of this Invention are pro- 
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vided. Also included in this aspect are genes encoding combinations of two or more mutations that include mutations 
producing the critical motif of this invention. 

[001 6] In yet another aspect, the invention also provides improved methods of ON A sequencing that allow the use of 
lower concentrations of fluorescein dye family-labeled ddNTPs. thereby reducing the cost of performing the reactions. 

5 The improved methods of the invention also allow the use of lower ratios of fluorescein dye family-labeled ddNTPs to 
dNTPs. Use of these methods results in numerous advantages, including more efficient polymerization, lower concen- 
trations of tenrplate nucleic acid being required, and a decreased likelihood of introducing inhibitors into the reaction 
mix. These advantages also facilitate the sequencing of long templates. The invention also provides improved methods 
of sequencing wherein sequencing reactions can be loaded directly onto sequencing gels for subsequent electrophore- 

10 sis without intermediate purification. 

[0017] Thus, in one embodiment of the invention, the invention provides improved methods for determining the 
sequence of a target nucleic acid using a recombinant enzyme which has a) a mutation at position 4 which produces 
the critical motif of this invention and b) has reduced dtsalmlnation against incorporation of nucleotides labeled with flu- 
orescein family dyes in comparison with the con'esponding wild-type enzyme. Also within the scope of this invention are 

IS innproved sequencing methods using thermostable DNA polymerase enzymes derived from thermophilic species, 
where the enzymes contain naturally occurring sequence variations that produce the critical nx)tif of this invention. 
These native enzymes can also provide reduced discrimination against incorporation of unconventional nucleotides. In 
this embodiment, the invention provides improved methods of sequencing using a native thermostable DNA polymer- 
ase a) having the critical motif of this invention wherein the amino acid in position 4 is not Glu and b) having reduced 

20 discrimination against incorporation of nucleotides labeled with fluorescein family dyes. 

[001 8] Also within the scope of this invention are improved methods of producing DNA labeled with fluorescein family 
dyes. The enzymes of the invention efficiently incorporate f luoresceln-labeled dNTPs in a polymerase chain reaction 
method, producing anrplif led products that are labeled at various sites with fluorescein family dyes. Thus, in one embod- 
iment, an innproved method of labeling DNA comprises a) providing a reaction mixture comprising dNTPs labeled with 

26 fluorescein family dyes and an enzyme of the invention and b) performing a nucleic acid amplification reaction. 

(001 9] The enzymes of the invention, and genes encoding these enzymes, provide additional aspects of the invention 
which are kits for DNA sequencing that comprise a recombinant enzyme of the invention and may additionally Include 
a negatively charged fluorescent terminator compound. Other kits for DNA sequencing comprise a) a negatively 
charged fluorescent terminator compound and b) a native enzyme of tiie invention. 

30 [0020] The invention also provides kits for producing labeled DNA which conrtprise a recombinant enzyme of the 
invention. Other kits for producing labeled DNA comprise a) a negatively charged fluorescent nucleoside triphosphate 
compound and b) a native enzyme of the invention. 

Briet Description of the Drawing 

3$ 

[0021] 

Figure 1 is a schematic representation of the Taq DNA polymerase gene. Restriction sites are indicated that relate 
to Example 1 and the description of methods for preparing additional mutants and expression vectors provided 
40 herein. 

Detailed Description of the Invention 

[0022] To facilitate understanding of tiie invention, a number of terms are defined below. 

45 [0023] The term "gene" refers to a DNA sequence that comprises control and coding sequences necessary for the 
production of a recoverable bioactive polypeptide a precursor. The polypeptide can be encoded by a full-length gene 
sequerKe or by any portion of the coding sequence so long as the enzymatic activity Is retained. 
[0024] The term "native* refers to a gene or gene product which is isolated from a naturally occurring source. This 
term also refers to a recombinant form of the native protein produced by molecular biological techniques which has an 

60 amino add sequence identical to that of the native form. 

[0025] The term "mutant" refers to a gene that has been altered In its nucleic acid sequence or a gene product which 
has been altered in its amino acid sequence, resulting in a gene product which may have altered functional properties 
when compared to the native or wild-type gene or gene product Such alterations include point mutations, deletions and 
insertions. 

55 [0026] The term "host cell(s)" refers to botii single-cellular prokaryote and eukaryote organisms such as bacteria, 
yeast and actinomycetes and single cells from higher order plants or animals when being grown in cell culture. 
[0027] The term "expression system" refers to DNA sequences containing a desired coding sequence and control 
sequences in operable linkage, so that host cells transformed with these sequences are capable of producing the 



4 



EP 0 902 035 A2 



encoded proteins. To effect transformation, the expression system may be included on a vector; however, the relevant 
ONA nrtay also be integrated into the host chronrK)some. 

[0028] The term "oligonucleotide" as used herein is defined as a nrK)lecule comprised of two or mae deoxyribonude* 
otides or ribonucleotides, preferably more than three, and usually more than ten. The exact size of an oligonucleotide 

5 will depend on many factors, including the ultimate function or use of the oligonucleotide. 

[0029] Otigonudeottdes can be prepared by any suitable method, including, for example, cloning and restriction of 
appropriate sequences and direct chemical synthesis by a method such as the phosphotriester method of Narang et 
al., 1979. Meth. Enzymol. 68:90-99: the phosphodi ester method of Brown etal.. 1979. Meth. Enzvmol. 68:109-151: the 
diethylphosphaamidite method of Beaucage et aL. 1981 . Tetrahedron LfilL 22: 1859-1862; the triester method of Mat- 

10 teucd et aL, 1981 J, AOL Chem. Soc. 103:3185-3191 or automated synthesis methods; and the solid support method 
of U.S. Patent No. 4,458.066. which publications are each incorporated herein by reference. 
[0030] TTie term "primer" as used herein refers to an oligonucleotide, whether natural or synthetic, which is capable 
of acting as a point of initiation of synthesis when placed under conditions in which primer extension is initiated. A 
primer is preferably a single-stranded oligodeoxyribonudeotide. The appropriate length of a primer depends on the 

15 intended use of tiie primer but typically ranges from 15 to 35 nucleotides. Short primer molecules generally require 
cooler tenperatures to form suffidentiy stable hybrid complexes with the template. A primer need not reflect tiie exact 
sequence of the template but must be suff icientiy complementary to hybridize with a template for primer elongation to 
occur. 

[0031] A primer can be labeled, if desired, by incorpaating a label detectat^le by spectroscopic, photochemical, bio- 
20 chemical, immunochemical, or chemical means. For example, useful labels include ^P, fluorescent dyes, electron- 
dense reagents, enzymes (as commonly used in ELiSA assays), biotin. or haptens and proteins for which antisera or 
monoclonal antitxxiies are available. 

[0032] The term "thermostable polymerase," refers to an enzyme which is stable to heat, is heat resistant and retains 
sufficient activity to effect subsequent primer extension reactions and does not become irreversibly denatured (inacti- 

25 vated) when subjected to the elevated temperatures for tiie time necessary to effect denaturation of double-stranded 
nudeic adds. The heating conditions necessary for nucleic acid denaturation are well known in the art and are exem- 
plified in U.S. Patent Nos. 4.683.202 and 4,683.195. which are incorporated herein by reference. As used herein, a ther- 
mostable polymerase is suitat)le for use in a temperature cycling reaction such as the polymerase chain reaction 
("PGR"). Irreversible denaturation for purposes herein refers to permanent and complete loss of enzymatic activity. For 

30 a thermostable polymerase, enzymatic activity refers to the catalysis of the combination of the nucleotides in tiie proper 
manner to form primer extension products that are complementary to a template nucleic acid strand. 
[0033] The term "conventional" or "natural" when referring to nucleic acid bases, nucleoside triphosphates, or nude- 
otides refers to those which occur naturally in the polynucleotide being described (i.e., for DNA these are dATP, dGTP, 
dCTP and dTTP). Additionally. dlTP. and 7Kjeaza-dGTP are frequentiy utilized in place of dGTP and 7-deaza-dATP can 

35 be utilized in place of dATP in in vitro DNA synthesis reactions, such as sequendng. Collectively these may be referred 
toasdNTPs. 

[0034] The term "unconventional" or "modified" when referring to a nudeic acid base, nudeoskle. or nucleotide 
indudes nrtodif ication, derivations, or analogues of conventional bases, nudeosides. or nucleotides that naturally occur 
in a particular polynucleotide. The deoxyribonudeotide form of uracil is an unconventional or modified base in ONA 

40 (dUMP), whereas, the rtoonudeotide form of uracil is a conventional base In RNA (UMP). As used herein, unconven- 
tional nucleotides include but are not limited to compounds used as terminators for nudeic acid sequendng. Terminator 
conrpounds indude but are not limited to those conpounds which have a 2\3* dideoxy structure and are refen-ed to as 
dideoxynudeoside triphosphates. The dideoxynudeoside triphosphates ddATP. ddTTP. ddCTP and ddOTP are referred 
to collectively as ddNTPs. CMher unconventional nudeotides indude phosphorothioate dNTPs ([a-S]dNTPs). 5*-a- 

45 borano-dNTPs. a-methyl-phosphonate dNTPs. and ribonudeoside triphosphates (rNTPs). Unconventional bases may 
be labeled with radioactive isotopes such as ^^p, ^^p^ fluorescent labels; chemiluminescent labels; biolumines- 
cent labels; hapten labels such as biotin; or enzyme labels such as streptavidin or avidin. Fluorescent labels may 
indude dyes that are negatively charged, such as dyes of tiie fluorescein family, or dyes that are neutral in charge, such 
as dyes of the rhodamine family, or dyes that are positively charged, such as dyes of the cyanine tamily. Dyes of the 

60 fluorescein feunily indude e.g., FAM, HEX. TET. JOE, NAN and ZOE. Dyes of the rhodannine tamily indude Texas Red. 
ROX, R1 10. R6G. and TAMRA. FAM. HEX. TET JOE. NAN, ZOE. BOX. R110. R6Q. and TAMRA are nrwrketed by Per- 
Wn-Elmer (Foster City, OA), and Texas Red is marketed by Molecular Probes. Dyes of the cyanine tamfly indude Cy2, 
Cy3, Cy5, and Cy7 and are marketed by Amersham (Amersham Place, Uttie Chalfort, Buckinghamshire. England). 
[0035] The term "DNA synthesis reaction" refers to methods of produdng copies of DNA induding but not limited to 

55 PGR, strand displacement amplification, transaiption mediated amplification, primer extension and reverse transaip- 
tion. 

[0036] In order to further fadlitate understanding of the invention, specific thermostable DNA polymerase eruymes 
and fluorescent dyes are referred to throughout the specification to exemplify the invention, and these references are 
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not intended to limit the scope of the invention. 

[0037] The present invention provides novel and improved compositions which are thermostat^e DNA polymerases. 
The enzymes of the invention include recombinant polymerases which nriore efficiently incorporate nucleoside triphos- 
phates labeled with fluorescein family dyes in comparison to the corresponding wild-type enzymes. The thermostat)le 

5 DNA polymerases of the invention are more suitable and desirable for use in processes such as DNA sequencing and 
in vitro synthesis of labeled products than prior art polymerases. Improved DNA sequencing methods of the invention 
include the use of these recombineint polymerases as well as the use of native enzymes which more efficiently incor- 
porate nucleoside triphosphates labeled with fluorescein family dyes than previously characterized enzymes. DNA 
sequences encoding these enzymes, and vectors for expressing the proteins are also provided. 

10 [0038] The thermostable DNA polymerases of the invention possess a region of criticality within the amino acid 
sequence of the polymerase activity domain of the enzyme. The critical region within the amino acid sequence of a ther- 
mostable DNA polymerase provided by the present invention is shown below using the conventional single-letter amino 
acid code (Lehninger. Biochemistry. New YorK New York, Worth Publishers Inc., 1970. page 67. which is incorporated 
herein by reference). 

15 SEQ ID NO: 7 LSXXLX(V/I)PXXE where the "X" at position 4 Indicates any amino acid except E. In the three-letter code 
for amino adds, this sequence is represented as LeuSerXaaXaaLeuXaaXaaProXaaXaaGlu (SEQ ID NO: 7) whereby 
"Xaa" at positions 3. 6. 9. and 10 is any amino acid, "Xaa" at position 4 of this sequence is any amino add but not a 
glutamic add residue (Glu) and "Xaa" at position 7 is Val or He. This region of criticality provides thermostable DNA 
polymerase enzymes characterized by the ability to effidently Incorporate nudeotides labeled with fluorescein family 

20 dyes. 

[0039] For example, in a derivative of the Thermus aquaticus (Tag) DNA polymerase gene which already contains a 
glycine to aspartic acid mutation at position 46 (G46D) and an F667Y mutation, a mutation of G to A in the first position 
of the codon for glutamic acid at residue 681 sequence of the full length Taq DNA polymerase sequence (corresponding 
to position 4 of the critical motif) results in an enzyme having the critical motif. This enzyme displays 1) an approxirr^tely 
2S a 2- to 10-fold Inaease in the efficiency of incorporation of nudeotides labeled with fluorescein family dyes with no 
impairment of the enzyme's ability to mediate PGR in the presence of conventional nucleotides and 2) a 3 to 4.3-fbld 
increase in the extension rate. In Taq DNA polymerase this particular mutation results in an amino acid change of E 
(glutamic add) to K (lysine). 

[0040] Although this particular amino acid change produced the critical motif and significantly alters the ability of the 
30 enzyme to incorporate unconventional nucleotides, it is expected that the specific change of E to K is not as critical to 
the invention as is the now identified position within the region of aiticality. Thus, in a preferred embodiment, the inven- 
tion provides recombinant thernriostabie DNA polymerase enzymes which are characterized in that a) in its native form 
said polymerase comprises the amino acid sequence LSXXLX(V/I)PXXE (SEQ ID NO: 1), where X is any amino add; 
b) the X at position 4 in said sequence is mutated in comparison to said native sequence, except that the X at position 
35 4 Is not mutated to E; and c) said thernfK)stable DNA polymerase has reduced discrimination against incorporation of 
nudeotides labeled with fluorescein family dyes in comparison to the native form of said enzyme. In a more prefenred 
ennbodiment, the X at position 4 is replaced by an amino acid having a positive charge, such as K. R or H. or by a polar 
amino add such as Q or N. In a most preferred embodiment, the X at position 4 is replaced by K. 
[0041] In another preferred embodiment of the invention, the of the invention is characterized in that the enzyme (a) 
40 has reduced dtsaimination against fluorescein dye family labeled nudeotides and (b) conrprlses the amino add 
sequence LS(QA3)XL(S^A)1PYEE where X is any amino acid (SEQ ID NO: 2). In three-letter code, this amino acid 
sequence is represented as follows: LeuSerXaaXaaLeuXaalleProTyrGluGlu, whereby "Xaa" at position 3 is Gin or Gly, 
^Xaa" at position 4 is any amino acid, and "Xaa" at position 6 is Ser or Ala. 

[0042] In a more preferred embodiment of the Invention, the enzyme having reduced disaimination against f lucres- 
45 cein dye family labeled nudeotides comprises the amino add sequence LSQXLAIPYEE where X is any amino acid 
(SEQ ID NO: 3). In the three-letter code, this amino add sequence is represented as LeuSerGInXaal^uAlalleProTyr- 
GuGu. whereby "Xaa" at position 4 is any amino acid. In a most preferred embodiment of the invention, the X is a K 
residue. 

[0043] In another prefen-ed embodiment of the invention, the the enzyme having reduced disaimination against f luo- 
so rescein dye family labeled nudeotides comprises the amino add sequence LSVXLQ(V/OPVKE where X is any amino 
add (SEQ ID NO: 4). tn the three-tetter code, this amino add sequence is represented as LeuSerValXaaLeuGlyXa- 
aProValLysGIu. whereby at position 4 is any antino acid and "Xaa" at position 7 is Val or He. 
[0044] In a nrwre preferred embodiment of the invention, the enzyme having reduced discrimination against fluores- 
cein dye family labeled nucleotides comprises the amino acid sequence LSVXLGVPVKE where X is any amino acid 
55 (SEQ ID NO: 5). In the three-letter code, this amino add sequence is represented as LeuSerValXaaLeuGlyValProVal- 
LysGlu. whereby *Xaa** at position 4 is any amino add. In a most preferred embodiment, the X is an R residue. 
[0045] In another more preferred embodiment the enzyme having reduced discrimination against f luaescein dye 
family labeled nudeotides comprises the amino acid sequence LSVXLGIPVKE where X is any amino acid (SEQ ID NO: 
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6). In the three-letter code, this amino add sequence is represented as LeuSerValXaaLeuGlylleProValLysGlu, whereby 
"Xaa" at position 4 is any amino acid. In a nrost preferred embodiment the X is an R residue. 
[0046] The characterization of the E681 K mutation described herein identified a region in the DNA polymerase gene 
that affects the ability of the polymerase to interact with negatively charged fluorescent nucleotides. This site, distal to 

5 helix O. is at the end of the Oa helix and the beginning of the Ob helix of the polymerase (Kim. et aL, 1995. NatM^e. 
226:612). Based on molecular modeling principles well-known in the art. changes in the structure of the Oa-Ob helix 
other than E to K at position 681 are also expected to produce changes in the ability of the polymerase to discriminate 
against nucleotides labeled with fluorescein family dyes. Thus, mutations at positions in the critical motif other than 
those in the X residue at position 4 are also within the scope of this invention. In this embodiment, the invention provides 

10 a recombinant thermostable DNA polymerase enzyme which is characterized in that (a) in its native form, the polymer- 
ase comprises the amino acid sequence I^XXLX(V/I)PXXE (SEQ ID NO: 1) where X is any amino ackJ, (b) the recom- 
binant polymerase comprises at least one mutation within the amino add sequence, except that X at position 4 is not 
mutated to E. and c) the enzyme has reduced discrimination against incorporation of nucleotides labeled with fluores- 
cein family dyes, in comparison to the coresponding native enzyme. 

15 [0047] Similarly, thermostable DNA polymerases that conrprise aitical motifs that are similar, but not identical to the 
critical motif that is amino acid sequence LSXXLX(V/I)PXXE (SEQ ID NO: 7) where X at position 4 is any amino acid 
except E, are within the scope of this invention. Specifically, in one embodiment, the critical motif is the amino acid 
sequence LXXXXXXXXXE (SEQ ID NO: 8) where X at position 4 is any amino acid except E. In the three letter code, 
this amino acid sequence is represented as LeuXaaXaaXaaXaaXaaXaaXaaXaaXaaGlu (SEQ ID NO: 8). whereby 

20 "Xaa" at positions 2. 3. 5. 6. 7. 8. 9 and 10 are any amino acid and "Xaa" at position 4 is any amino acid except Glu. 
[0048] In another embodiment, the critical motif is amino acid sequence L(S/A)XX(LyOXXXXXE (SEQ ID NO: 9) where 
X at position 4 is any amino add except E. In the three-letter code, this amino acid sequence is represented as LeuX- 
aaXaaXaaXaaXaaXaaXaaXaaXaaGlu (SEQ ID NO: 9). whereby "Xaa" at positions 3, 6. 7, 8. 9, and 10 are any amino 
acid. "Xaa" at position 2 is Ser or Ala. "Xaa" at position 4 is any amino add except Glu. and "Xaa" at position 5 is Leu 

25 or lie. 

[0049] In yet another embodiment, the aitical motif is amino acid sequence LSXXLXXXXXE (SEQ ID NO: 10) where 
X at position 4 is any amino add except E. In the three-letter code, this amino acid sequence is represented as LeuS- 
erXaaXaaLeuXaaXaaXaaXaaXaaGlu (SEQ ID NO: 10). whereby "Xaa" at positions 3. 6. 7, 8. 9. and 10 are any amino 
acid and "Xaa" at position 4 is any amino acid except Glu. 

30 [0050] The ability of the enzymes of this invention to efficientiy incorporate nucleotides labeled with fluorescein family 
dyes is measured by ddNTP incorporation assays. One such assay is a primer extension competition assay conducted 
under conditions of limiting tenplate. In this assay, a primer DG48 (5'-GGGAAGGGCGATCGGTGCGGGCCTCT- 
TCGC). (SEQ ID NO: 1 1), bound to f^13mp18 template (Innis et aL, 1988, Proc. fM. Apad. gg, USA 65:9436) is 
extended in the presence of [a-^P]dCTP and excess enzyme with various levels of a fluorescently labeled ddNTP, 

35 Zowie<klCTR Because the incorporation of a ddCTP residue terminates the extension reaction, the more readily a 
DNA polymerase incorporates a ddCTP into an extended primer, the less [a-^PJdCTP can be incorporated. Thus, as 
the effidency of fluorescently labeled ddCTP incorporation increases, the extent of inhibition of DNA synthesis is 
Increased. The reaction were also performed with various levels of an unlabeled ddCTP. The concentrations of ddCTP 
and Zowie-ddCTP needed for 50% inhibition were calculated and conpared to give a relative measure of the ability of 

40 the enzyme to incorporate the f luorescently-Jabeled nucleotide. The details of the ddNTP incorporation assay are pro- 
vided in Example II. 

[0051 ] Thus, in one embodiment of the invention, the characteristic of reduced disaimination against incorporation of 
nudeotides labeled with fluorescein family dyes is measured by the fluorescent ddNTP incorporation assay described 
in Exanple II. In a preferred enixxliment. the concentration of a ddNTP labeled with a fluorescein dye. Zowie-ddCTP, 
45 required for 50% inhibition of DNA synthesis is reduced at least 3-fold fa a mutant enzyme of the invention, relative to 
the wild-type enzyme. In a more preferred embodiment, the concentration is reduced at least 5-fold. In a most preferred 
embodiment, the concentration is reduced at least 10-fold. In another embodiment the characteristic of reduced dis- 
crimination is assayed by measuring fluorescent dNTP incaporation. 

[0052] In another aspect of the invention, the thermostable DNA polymerase gene sequence and enzyme are derived 
so from various thernrx>philic species. In one embodiment, the polymerase gene sequence and enzyme are from a species 
of the genus Thermus, In other enrtxxjiments of the invention, the gene sequence and enzyme are from thermophilic 
species other than Thermus. The full nudeic add and amino add sequence fbr numerous thermostable DNA polymer- 
ases are available. The sequences each of Tap, Thermus thermopilus (Tth), Thermus species 205, Thermus species 
6ps17. Thermotoga maritima (Tma). and Thermosipho africanus (7a/) polymerase have been published in PCT Inter- 
55 national Patent Application No. PGTAJ.S.91A)7035 which published as POT Patent Publication No. WO 92/06200 on 
April 16. 1992, and is incorporated herein by reference. The sequences for the DNA polymerase from Thermus fiavus. 
Bacillus caldotenax, and Bacillus stearothermophilus have been published in Akhmetzjanov and Vakhitcv. 1992. 
NudfiiC Adds BjBSfifltCh 2Q (21):5839, Uemori et a!., 1993, iL BiodbfillL 113:401-410, and as accession number 
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BSU23149.ng from the NG:New GenBank database, respectively, which are each incorporated herein by reference. 
The sequence of the thernrtostable DNA polymerase from Thermus caldophilus is found in EMBUGenBank Accession 
No. U62584. The sequence of the thernrwstable DNA polymerase from Thermus filiformis can be recovered from ATCC 
Deposit No. 42380 using the methods provided in U.S. Patent No. 4.889,818. as well as the sequence information pro- 
5 vided in Table 1 . The sequence of the Thermotoga neapolitana DNA polymerase is from GeneSeq Patent Data Base 
Accession No. R98144 (also disclosed in PCT WO 97A)9451). 

Table I 

Organism Critical Motif Critical Amino Acid 

Position 



J5 


Consensus 


L 


S/a 






L/i 






- - . E 






Thermus aquaiicus 


L 


S 


Q 


E 


L 


A 


I 


P Y E E 


681 




Thermus flavin 


L 


S 


G 


E 


L 


S 


I 


P Y E E 


679 


20 


Thermus thermophilus 


L 


S 


Q 


E 


L 


A 


I 


P Y E E 


683 




Thermus specie Z05 


L 


S 


Q 


E 


L 


A 


I 


P Y E E 


683 




Thermus specie spsl 7 


L 


S 


Q 


E 


L 


S 


I 


P Y E E 


679 


25 


Thermus caldophilus 


L 


S 


Q 


E 


L 


A 


I 


P Y E E 


683 




Thermus filiformis 


L 


S 


Q 


E 


L 


S 


I 


P Y E E 


679 


30 


Thermotoga maritima 


L 


S 


V 


R 


L 


G 


V 


P V K E 


744 


Thermotoga neapolitana 


L 


S 


V 


R 


L 


G 


I 


P V K E 


744 




Thermosipho qfricanus 


L 


S 


K 


R 


I 


G 


L 


S V S E 


743 


35 


Bacillus caldotenax^ 


L 


A 


Q 


N 


L 


N 


I 


S R K E 


725, 724 




Bacillus stearothermophilus'^ L 


A 


Q 


N 


L 


N 


I 


T R K E 


724, 727, 802 



1. Protein sequence from Accession No. D12982, Uemori T., Ishino Y., Fujita K., Asadi 
K., Kato I. "Cloning of the DNA polymerase gene of Bacillus caldotenax anc 
characterization of the gene product" L Biochcm. 113:401 (1993). The critical residue ii 
that sequence is 725. An almost identical protein sequence is provided as a putativ< 
Bacillus stearothermophilus'* DNA Polymerase in Accession No, R45155 and WPI 93 
408323/51. The critical residue in that sequence is 724. 



55 
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2. There are several sequence submissions for Bacillus stearothermophilus DNA 
polymerase in the GeneBank, or SwissProt/PIR databases. Although these sequences are 
highly related, but somewhat different from one another, each contains the identical 
L(S/A)XX(Lyi)XXXXXE (SEQ ED NO: 9) motif, where X is any amino acid except E. In 
the three-letter code, this amino acid sequence is represented as 
UuXaaXaaXaaXaaXaaXaaXaaXaaXaaGIu (SEQ ID NO: 9), whereby "Xaa" at positions 3, 
6. 7, 8, 9, and 10 are any amino acid. "Xaa" at position 2 is Ser or Ala, "Xaa" at position 4 
is any amino acid except Glu, and "Xaa" at position 5 is Leu or He. 



In the table above, protein sequences comprising the Critical Residue in the Critical Motif at position 724 are provided 
by Japanese patent publication J 05 304 964A, EP No. 699.760, and Accession No. U33536. Another highly related, 
but somewhat different, protein sequence was published in Senfi 164:65-68(1 995). contains the Critical Residue in the 
Critical Motif at position 727. Another highly related, but somewhat different, protein sequence, Accession No. U23149. 
for Bst DNA polymerase contains the Critical Residue in the Critical Motif at position 802. 

[0053] Because the DNA polymerases of each thermophilic species are unique, the amino acid position of the region 
of aiticality is distinct for each enzyme. Amino acid and nucleic add sequence alignment programs are readily available 
and, given the particular region identified herein, serve to assist in the identification of the exact sequence region of the 
invention. Such sequence alignment programs are available from the Genetics Conputer Group, 575 Science Drive. 
Madison, Wisconsin. Given the particular motif identified herein, these programs, including "GAP." "BESTFIT." and 
"PILEUP." serve to assist in the localization of the critical motif. The position of the regions of criticality are shown in 
Table I for thermostable DNA polymerases from exemplary thermophilic species. 

[0054] Regardless of the exact position of the critical motif LSXXIJ((V/I)PXXE (SEQ ID NO: 7), where X at position 4 
is any amino acid except E. within the polymerase domain of a thernrostable DNA polymerase, the presence of the motif 
serves to provide thermostable DNA polymerases having the ability to efficiently incorporate nucleotides labeled with 
fluorescein family dyes. TTierefore. mutation of the conserved glutamic acid of the thermostable DNA polymerases of 
Therms flavus (Glu 679), Thermus thermophilus (Gu 683). Thermus species Z05 (Gu 683), Thermus species sps1 7 
(Gu 679) Thermus caldophilus (Glu 683), Thermus filiformis (Glu 679) to produce the critical motif will provide an 
enhancing effect on the aWlity of the polymerase to efficiently incorporate nucleotides labeled with fluorescein family 
dyes. 

[0055] In addition, in view of the highly conserved nature of the now identified critical motif, novel thermostable DNA 
polymerases nnay be identified based upon their homology to. for example. Taq DNA polymerase or the sequences of 
other DNA polymerases in Table I (see for example U.S. Patent Nos, 5.618.711 and 5,624,833 which are herein Incor- 
porated by reference). Such polymerases, so long as their peptide sequence is at least 45% and nr>ost preferably 
greater than 80% homologous to the Taq polymerase amino add sequence, as detarmined by the methods described 
herein, are within the scope of the present invention. Consequently, the invention relates to a class of enzymes which 
also includes, for example, the thermostable DNA polymerase, and corresponcfing gene and expression vectors from 
Thermus oshimai (Williams RA, et a/. 1996 M i Sysl Bacteriol 46 (2): 403^8); Thermus silvanus and Thermus 
chliarophilus (Terxcelro S, efal.. 1995. Int. J. Syst. Bacteriol 45 (4^: 633-639): Thermus scotoductus (Venr eiro S et al., 
1995. Bfis^MiCCAiQl J4£(4): 315-324): Thermus ruber MCCZ59AB. (LG. Loginova. 1984. JnL»LS^fiaclfiriQl3i: 
498-499): and Therms brockianus (Munster. M.J.. 1986, J, Gen. Microbiol 132: 1677), which publications are each 
incorporated herein by reference. 

(0056] Those of skill in the art will recognize that the above thermostable DNA polymerases with enhanced efficiency 
fa incorporating f luorescein-labeled nucleotides are most easily constructed by reconi)inant DNA techniques such as 
site-directed mutagenesis. See for example Sambrook et ah, Molecular Clooioa: A Lfl'?Qfi^^^''Y MaouaL Cold Spring 
HartKX, 1989. second edition, chapter 15.51. "Oligonudeotide-mediated mutagenesis," which is incorporated herein by 
reference. This technique is now standard in the art and can be used to create all possible classes of base pair 
changes at any deternnined site in a gene. The method is performed using a synthetic oligonucleotide primer comple- 
mentary to a single-stranded phage or plasmid DNA to be mutagenized except for a limited nnismatching, which repre- 
sents the desired mutation. Briefly, the synthetic oligonucleotide is used as a primer to direct synthesis of a strand 
conplementary to the phage or plasmid. and the resulting double-stranded DNA is transformed into a phage- or plas- 
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mid-supporting host bacterium. The resulting bacteria can be assayed by, for example, DNA sequence analysis or 
probe hybridization to identify those plaques or colonies carrying the desired mutated gene sequence. 
[0057] Subsequent to the Invention of PGR. primer-directed mutagenesis (described in U.S. Patent No. 4.683,195. 
which Is herein incorporated by reference) and "overlap PGR" (Higuchl. 1989, in ECB Technology, ed. Erlich. Stockton 
Press, New YorK NY, pp.61 -70) have become routine n^eans of introducing any mutation at any position of a gene. 
(00581 The mutated DNA can be recovered from the plasmld. phasmid. phage or amplification reaction by conven- 
tional means and ligated into an expression vector for subsequent culture and purification of the resulting enzyme. 
Numerous cloning and expression vectors are suitable for practicing the invention, including nwrnmalian and bacterial 
systems, as described in, for example. Sambrook a/.. 1989 supra. For convenience, the present invention is exem- 
plified utilizing the lambda derived P^ promoter (Shimatake et al., 1981 . NatuTfi 222:128). Use of this promoter Is spe- 
cifically described in U.S. Patent Nos. 4.71 1 ,845 and 5,079,352, which are incorporated herein by reference. 
[0059] Plasmid pCSI has been deposited with the ATCG, on August 28, 1997, and given accession No. 98521. This 
plasmid contains a gene encoding a thernDostaWe DNA polymerase which gene is mutated at the codon at position 681 
such that glutamic acid is replaced with lysine in the resulting polypeptide and provides a means for providing ther- 
nfwstaWe DNA polymerases having an enhanced efficiency for incorporating nucleotkles labeled with fluorescein family 
dyes. Example I illustrates the use of flanking restriction sites suitable for subcloning the E681K mutation to create other 
thermostable DNA polymerase enzymes. Alternatively, because the complete gene sequence for numerous thermosta- 
ble DNA polymerases are known, other means for introducing the E681K mutation, such as restriction digestion and 
fragment replacement, are readily available to those of skill in the art. having the availability of ATCG deposits and the 
sequence information provided herein. 

[0060] When one desires to produce one of the mutant or native enzymes of the present invention, a a derivative or 
homologue of those enzymes, the production of the enzyme typically involves the transformation of a host cell with the 
expression vector, and culture of the transformed host cell under conditions such that expression will occur. Means for 
transforming and culturing transformed host cells are well known in the art and are desaibed in detail in, for example. 
Sambrook et al., 1989. supra. 

[0061] The thermostable DNA polymerases of the present invention are generally purified from E. coli strain DG1 16 
(deposited as ATCG 53606 on April 7. 1987) which has been transformed with an expression vector operaWy linked to 
a gene encoding a wild-type or modified thermostable DNA polymerase. Methods for purifying the thermostable DNA 
polymerase are described in. for example, Example I and Lawyer et aL, 1993, PGR Methods and Applications 2:275- 
87, which is incorporated herein by reference. 

[0062] The thernx)stable enzymes of the invention may be used for any purpose in which such enzyme activity is nec- 
essary or desired. Exanrtples of uses include DNA sequencing, DNA labeling, and labeling of primer extension products. 
DNA sequencing by the Sanger dideoxynudeotide method (Sanger et at, 1 977, Proc. NaJL Acqd. SsL lA' 5463) is par- 
ticularly improved by the present invention. Advances in the basic Sanger et al. method have provided novel vectors 
(>fenisch-Perton et al, 1985 Gene 53:103-119) and base analogues (Mills et al., 1979, Proc. Natl. Aga£LSgLZS:2232- 
2235, and Ban- et al., 1986, Biotechniques 4;428-432). In general. DNA sequencing requires template-dependent 
primer extension In the presence of chain-terminating base analogs, resulting in a distribution of partial fragments which 
are subsequently separated by size. The basic dideoxy sequencing procedure involves (0 annealing an oligonucleotide 
primer, optionally labeled, to a template; (ii) extending the primer with DNA polymerase in four separate reactions, each 
containing a mixture of unlabeled dNTPe and a limiting amount of one chain terminating agent such as a ddNTP, optk)n- 
ally labeled; and (iii) resolving the four sets of reaction products on a high-resolution denaturing polyacrylamideAjrea 
gel. The reaction products can be detected in the gel by autoradiography or by fluorescence detection, depending on 
the label used, and the Image can be examined to infer the nucleotide sequence. These methods utilize DNA polymer- 
ase such as the Klenow fragment of E coli Pol I or a modified T7 DNA polymerase. 

[0063] The availability of thernwresistant polymerases, such as Taq DNA polymerase, resulted in improved methods 
for sequencing with thermostable DNA polymerase (see Innis et al., 1988, supra) and modifications thereof referred to 
as "cyde sequencing" (Murray, 1989. Niffi Acids Res. 11:8889). As an alternative to basic dideoxy sequencing, cycle 
sequencing is a linear, asymmetric amplification of target sequences complementary to the template sequence in the 
presence of chain terminators. A single cycle produces a family of extension products of all possible lengths. Following 
denaturation of the extenston reactton product from the DNA template, multiple cycles of primer annealing and primer 
extension occur in the presence of terminators such as ddNTPs. Cyde sequencing requires less template DNA than 
conventional chain-termination sequencing. Thermostable DNA polymerases have several advantages in cyde 
sequendng; they tolerate the stringent annealing temperatures which are required for specific hybridization of primer to 
nudeic acid targets as well as tderating the multiple cycles of high temperature denaturation which occur in each cyde. 
i.e.. 90-95* C. For this reason, AnpHTaq® DNA Polymerase and its derivatives and descendants have been induded 
in Taq cycle sequencing kits commerdalized by companies such as Pertdn-Elmer, NorwalK CT. 
[0064] Two variations of chain termination sequendng methods exist - dye-primer sequendng and dye-terminator 
sequendng. In dye-primer sequencing, the ddNTP terminators are unlabeled, and a labeled primer is utilized to detect 
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extension products (Smith et al., 1986, Nature 32 :674-679^ In dye-terniinator DNA sequencing, a DNA polymerase is 
used to Incorporate dNTPs and fluorescently labeled ddNTPs onto the end of a DNA primer (Lee et a/., supra,). This 
process offers the advantage of not having to synthesize dye labeled primers. Furthermore, dye-terminator reactions 
are more convenient in that all four reactions can be performed in the same tube. 

5 [0065] Both dye-primer and dye-terminator methods may be automated using an automated sequencing instrument 
produced by Applied Biosystems, Foster City, CA. (U.S. Patent No. 5.171.534. which is herein incorporated by refer- 
ence). When using the instrument, the completed sequencing reaction mixture is fractionated on a denaturing polyacr- 
ylamide gel nrwunted in the instrument. A laser at the bottom of the instrument detects the fluorescent products as they 
are electrophoresed according to size through the gel. 

10 [0066] Two types of fluorescent dyes are commonly used to label the terminators used for dye-terminator sequencing 
- negatively charged and zwitterionic fluorescent dyes. Negatively charged fluorescent dyes include those of the fluo- 
rescein and BODIPY families. BODIPY dyes {4,4-difluoro-4-bora-3a,4a-diaza-s-indacene) are described in Interna- 
tional Patent Application. Publication No. WO 97/00967. which is incorporated herein by reference. Zwitterionic 
fluorescent dyes include those of the rhodamine family. Ck>mmercially available cycle sequencing kits use terminators 

15 labeled with rhodamine derivatives. However, the rhodamine-labeted terminators are rather costly and the product must 
be separated from unincorporated dye-ddNTPs before loading on the gel since they co-migrate with the sequencing 
products. Rhodamine dye family terminators seem to stabilize hairpin structures in GC-rich regions, which causes the 
products to migrate anomalously. This requires the use of dlTP which relaxes the secondary structure but also affects 
the efficiency of irxx>rporation of terminator. 

20 [0067] In contrast f luorescein-labeled terminators eliminate the separation step prior to gel loading since they have 
a greater net negative charge and migrate faster than the sequencing products. In addition, fluorescein-labeled 
sequencing products have better electrophoretic migration than sequencing products labelled with rhodamine. 
Although wild-type Taq DNA polymerase does not efficiently incorporate terminators labeled with fluorescein family 
dyes, this can now be accomplished efficiently by use of the modified enzymes provided herein. 

25 [0068] Thus, the scope of this invention includes novel methods for dideoxy sequencing using enzymes having the 
critical motif, as well as kits for performing the method. In one embodiment, the sequencing method of the invention 
conrprises: 

a) providing a recombinant thermostable DNA polymerase enzyme which is characterized in that 

30 

i) in its native form said polymerase comprises the amino acid sequence LSXXLX(V/I)PXXE (SEQ ID NO: 
1).where X is any amino acid. 

ii) the X at position 4 in said sequence is nrxjtated in comparison to said native sequence, except that X is not 
mutated to E; and 

35 iii) said thermostable DNA polymerase has reduced discrimination against incorporation of nucleotides labeled 

with fluorescein family dyes in comparison to the native form of said enzyme; and 

b) performing a dye-terminator sequencing reaction. 

40 [0069] In a prefenred embodiment of the above method, the native form enzyme has the amino acid sequence 
LS(C5/G)XL(S/A)IPYEE (SEQ ID NO: 2), where X is any amino add. In the three-letter code, this amino acid sequence 
is represented as LeuSerXaaXaaLeuXaalleProTyrGluGlu (SEQ ID NO: 2), whereby "Xaa" at position 3 is Gin or Gly. 
"Xaa" at position 4 is any amino acid, and "Xaa" at position 6 is Ser or Ala. In a more preferred embodiment, the native 
fam amino acid sequence is LSQXLAIPYEE (SEQ ID N0.-3), where X is any amino acid. In the three-letter code, this 

45 amino add sequence is represented as LeuSerGlnXaaLeuAlalleProTyrGIuGu (SEQ ID N0:3). whereby "Xaa" at posi- 
tion 4 is any amino add. In a most preferred embodiment, the "Xaa" at position 4 is Lys. 

[0070] As desaibed above. DNA sequendng with thermostable DNA polymerases requires a mixture of unconven- 
tional base analogues that act as chain-terminators and conventional nucleotides at a specified ratio of concentrations 
that insures that a population of extension products wouW be generated representing all possible fragment lengths over 

50 a distance of several hundred bases. Some thermostable DNA polymerases previously used for sequencing, such as 
wild-type Taq polymerase, are characterized in that they preferentially incorporate conventional nudeotides in the pres- 
ence of a mixture of conventional and unconventional nudeotides. However, some recently described thermostable 
DNA pdymerases allow the ratio of unconventional base analogues to conventional bases to be reduced from a hun- 
dred to several hundred fold, or up to over a thousand fold. 

55 [0071] One such polymerase is the F667Y mutant of Taq DNA polymerase. Another such mutant is a Taq DNA 
polymerase having an F667Y mutation and a mutation at position 46 which changes a glycine residue to an aspartic 
acid residue (G46D) mutation. This mutant polymerase, known as Ampli Taq. FS. is manufactured by Hoffmann-La 
Roche and marketed by PerWn-Elmer. F730Yrma30 DNA Pdymerase is another such polymerase. This mutant 



11 



EP0902 0a5A2 



polymerase is a combination of 1) nudeotides 1-570 of Taq DNA polymerase modified to encxxie a G46D mutation and 
2) nucleotides 571-2679 of Tma DNA polymerase modified to encode an aspartic acid to alanine mutation at position 
323, a glutamic acid to alanine mutation at position 325, and a phenylalanine to tyrosine mutation at position 730 (U.S. 
Patent Application Serial No. 60^052065 and European Patent Application No. 98112327.6, both hereby incorporated 
by reference). Another polymerase that incorporates unconventional base analogues is a F730Y mutant DNA polymer- 
ase from Thermotoga neapolitana (International Patent Application. Publication Nos. WO 96/1 0640. WO 96/41 01 4, and 
WO 97/09451 , which are hereby incorporated by reference). Using these enzymes, for a given dNTP concentration, the 
rhodamine-ddNTP concentration can be decreased by about 50- to several hundred-fold compared to thermostable 
DNA polymerases previously available. 

[0072] The E681 K mutation of the invention was conf*)ined using recombinant DNA methods with an F667Y mutation 
to produce the double mutant Tag DNA polymerase enzyme used in the sequencing reactions described in Example 
IV. The double mutant was used in a dye-terminator sequencing reaction with fluorescein-labeled dye terminators. The 
results, described in Exanrple IV, show that the enzyme is capable of incorporating fluorescein-labeled dye terminators 
in a sequencing reaction and produces sequencing ladders that can be accurately read in an automated sequencing 
instrument. Unexpectedly, the combination of the E681K and the F667Y mutations was also found to produce a ther- 
mostable DNA polymerase enzyme with a 3- to 4-fold increased extension rate relative to an enzyme with the F667Y 
mutation alone, as measured by the assay described in Example HI. 

[0073] Thus, in another aspect of this invention, the critical motif identified in this invention can be combined with 
motifs confemng reduced discrimination against ddNTPs to produce polymerases having an increased efficiency of 
incorporation of both labeled and unlabeled ddNTPs. These polymerases are useful in DNA sequencing methods. In 
one embodiment of the present invention, a thernrK)stable DNA polymerase having the critical motif defined herein also 
comprises the critical motif that includes the F667Y mutation, described in U.S. Serial No 08/448,223. In this embodi- 
ment, the thermostable DNA polymerase is characterized in that 

i) in its native form said polymerase comprises a first amino acid sequence LSXXLX(V/1)PXXE (SEQ ID NO: 1). 
where X is any amino acid, 

ii) the X at position 4 in said first amino acid sequence is mutated in comparison to said native sequence, except 
that X is not mutated to E; and 

iii) said thermostable DNA polymerase has reduced discrimination against incorporation of nucleotides labeled with 
fluorescein family dyes in comparison to the native form of said enzyme; and 

iv) said polymerase comprises a second amino acid sequence MRRXXKXXNYXXXYG (SEQ ID NO: 12) where X 
is any amino acid; 

v) said thermostable DNA polymerase also has reduced discrimination against incorporation of unconventional 
nucleotides in comparison to the native form of said enzyme. In the three-letter code, the second amino acid 
sequence is represented by MetArgArgXaaXaaLysXaaXaaAsnTyrXaaXaaXaaTyrGly (SEQ ID NO: 12), where 
"Xaa" at positions 4. 5, 7. 8, 1 1. 12, and 13 is any amino add. In a prefenred entixdiment the "Xaa" at position 4 
in the first amino add sequence is mutated to Lys. In a more preferred embodiment, the enzyme Is Taq DNA 
polymerase and It conprises the E681 K and F667Y mutations. Also within the scope of this invention are methods 
of sequendng using the above polymerases. 

[0074] Also within the scope of this invention is the improved sequendng method of the invention perfomied using 
thermostable DNA polymerase enzymes having a aitical motif which is not derived by mutation, but which critical motif 
exists as a natural variant In this aspect, the DNA polymerase of a thermophilic bacterial species has a aitical motif in 
which the residue at position 4 is not Glu. For exanple. in the thermostable DNA polymerase from Thermotoga 
neapolitana, the X at position 4 in the motif LSXXLX(V/OPXXE (SEQ ID NO: 7), where X is any amino acid except E. Is 
an arginine residue. Thus, the invention provides for inproved methods of DNA sequendng using a native thernwstable 
DNA polymerase which comprises the amino add sequence LSXXU((V/I)PXXE (SEQ ID NO: 7) where X can be any 
amino acid except E. In the three-letter code, this amino acid sequence is represented as LeuSerXaaXaaLeuXaaXa- 
aProXaaXaaGlu (SEQ ID NO: 7), where "Xaa" at positions 3. 6, 9. and 10 are any amino add and "Xaa" at position 4 
Is any amino acid except Glu and "Xaa* at position 7 is Val or lie. In this embodiment, the sequendng method of the 
Invention comprises: 

a) providing a thermostable DNA polymerase which is characterized in that 

0 said polymerase comprises the amino acid sequence 

LSXXLX(V/I)PXXE (SEQ ID NO: 7). where X at position 4 is any amino acid except E, 

Ii) said tiiermostaWe DNA polymerase has reduced discrimination against incorporation of nudeotides labeled 

with fluorescein family dyes; and 
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b) providing a dye-terminator labeled with a fluorescein family dye. and 

c) performing a dye-terminator sequencing reaction. 

[0075] In a mae pretended embodiment, the sequencing method of the invention comprises: 

a) providing a thermostable DNA polymerase which is characterized in that 

i) said polymerase comprises a first amino add sequence 

LSXXLX(V/I)PXXE (SEQ ID NO: 7). where X at position 4 is any amino acid except E, 

il) said thermostable DNA polymerase has reduced discrimination against incorporation of nucleotides labeled 

with fluorescein family dyes; 

iii) said polymerase comprises a second amino acid sequence MRRXXKXXNYXXXYG (SEQ ID NO; 12) where 
X is any amino add. 

iv) said thermostable DNA polymerase has reduced discrimination against incorporation of unconventional 
nucleotides; and 

b) providing a dye-terminator labeled with a fluorescein family dye. and 

c) performing a dye-terminator sequencing reaction. 

[0076] In another preferred embodiment, the enzyme contprises the amino add sequence LS{Q/G)XL(SyA)IPYEE 
(SEQ ID NO: 13). where X at position 4 is any amino acid except E. In the three-letter code, this amino acid sequence 
is represented as LeuSerXaaXaaLeuXaalleProTyrGluGlu (SEQ ID NO: 13) whereby "Xaa" at position 3 is Gin or Gly. 
"Xaa" at position 4 is any amino acid except GIu. and "Xaa" at position 6 is Ser or Ala. In a more preferred embodiment, 
the amino acid sequence is LSQXLAIPYEE (SEQ ID NO: 14), where X is any amino acid except E. In the three-letter 
code, this amino acid sequence is represented as LeuSerGlnXaaLeuAlalleProTyrGluGlu (SEQ ID NO: 14). whereby 
"Xaa" at position 4 is any amino acid except Gu. 

[0077] In yet another preferred embodiment the enzyme has the amino add sequence LSVXLG(V/I)PVKE (SEQ ID 
NO: 15). where X Is any amino acid except E. In the three-letter code, this amino acid sequence is represented as 
LeuSerValXaaLeuGlyXaaProValLysGlu (SEQ ID NO: 15). whereby "Xaa" at position 4 is any amino add except GIu and 
"Xaa" at position 7 is Val or lie. In a more preferred erribodiment the amino add sequence is LSVXLGVPVKE (SEQ ID 
NO: 16). where X is any amino acid except E. In the three-letter code, this amino acid sequence is represented as 
LeuSerValXaaLeuGlyValProValLysGlu (SEQ ID NO: 16), whereby ")(aa" at position 4 is any amino acid except GIu. In 
a most preferred embodiment, the "Xaa" at position 4 is Arg. In another rrrore preferred embodiment, the amino acid 
sequence is LSVXLGIPVKE (SEQ ID NO: 1 7), where X is any amino add except E. In the three-letter code, this amino 
acid sequence is represented as LeuSerValXaaLeuGlylleProValLysGlu (SEQ ID NO: 17), whereby "Xaa" at position 4 
is any amino acid except GIu. In a most preferred embodiment, the "Xaa" at position 4 is Arg. 
[0078] In another embodiment of the invention, the sequencing methods are performed using a native enzyme which 
has a reduced level of discrimination against nucleotides labeled with fluorescein family dyes which level is measured 
using a ddNTP Incorporation assay such as that desaibed in Example II. The concentration of ddCTP required fa 50% 
inhibition of DNA synthesis is determined, as is the concentration of Zowie-ddCTP needed for 50% inhibition. The ratio 
of the concentration for Zowie-ddCTP to the concentration for ddCTP is calculated. In a preferred embodiment, the ratio 
is 10 or less. In a more prefened embodiment, the ratio is 4 or less. In a most prefen-ed emtxKiiment, the ratio is 1.2 or 
less. 

[0079] Although the examples provided herein use dideoxynudeotides labeled with fluorescein fanrrily dyes, the use 
of other unconventional nudeotides and fluorescent dyes is also within the scope of this invention. Other unconven- 
tional nudeotides include fluorescently-laWed dNTPs. which can be used to label the products of DNA synthesis, and 
f luorescently-iabled rNTPs. which can be used to label the primer extension products. Other dyes include other nega- 
tively charged fluorescent dyes, such as BODIPY. which are structurally and chemically similar to fluorescein. Other 
dyes also indude cyanine dyes. Cyanine-labeled dNTPs were added to a standard PGR reaction which induded a Taq 
DNA polymerase with the E681K mutation (and a G46D mutation). The cyanine-labeled dNTPs were unexpectedly 
found to be incorporated into amplification products at a level that was higher than for the wild-type enzyme. Thus, In 
this aspect, a method of labeFtng DNA of the Invention uses a native or mutant polymerase of the invention in <»xnbina- 
tion wrth a nucleotide labeled with either a negatively charged fluorescent dye or a cyanine dye. In one enrtxxliment. 
the DNA labeling method of the invention comprises: 

a) providing a thermostable DNA polymerase characterized in that 

i) said polymerase comprises the amino acid sequence 
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LSXXLX(V/OPXXE (SEQ ID NO: 7) where X at position 4 can be any amino acid except E 

ii) said polymerase has reduced discrimination against Incorporation of unconventional nucleotides, and 

b) providing a nucleotide labeled with a negatively charged fluorescent dye. and 

c) performing a DNA synthesis reaction. 

(0080] In another emlxxjiment. the DNA labeling method of the invention comprises: 

a) providing a thermostable DNA polymerase characterized in that 

i) said polymerase comprises the amino acid sequence 

LSXXLX(V/I)PXXE (SEQ ID NO: 7) where X at position 4 can be any amino acid except E 

ii) said polymerase has reduced discrimination against incorporation of unconventional nucleotides, and 

b) providing a nucleotide labeled with a cyanine dye. and 

c) performing a DNA synthesis reaction. 

[0081] In another aspect of the invention, a thernx)stabie DNA polymerase is provided which combines a mutation 
allowing more efficient incorporation of rNTPs, such as the glutamic ackJ to glycine mutation at posrtion 615 of Taq DNA 
polymerase, and the critical motif of this invention. The resulting enzyme is expected to have an inaeased efficiency of 
incorporation of ribonucleotides labeled with fluorescein family dyes. Thus, in one embodiment, the invention provides 
a recorrtjinant thermostable DNA polymerase wNch (1) in its native form comprises the amino acid sequence 
LSXXLX(V/I)PXXE (SEQ ID NO: 1) where X is any amino acid and (2) the X at position 4 is mutated such that X at posi- 
tion 4 is not mutated to E and (3) also comprises the region of criticality which is amino acid sequence SQIXLR(V/I) 
(SEQ ID NO: 18) where "X" is any amino add except E. and (4) is capable of efficient incorporation of ribonucleotides 
labeled with fluorescein family dyes. In the three-letter code, the latter sequence is represented as SerGlnlleXaa- 
LeuArgXaa, where "Xaa** at position 4 is any amino acid except Glu and "Xaa" at position 7 is Val or lie 
[0082] Mutant polymerase domains such as that for Taq containing the E615G and E681K mutations are useful in 
improved methods of producing primer extension products labeled with fluorescein family dyes. For example, in a 
primer extension reaction such as PGR. rNTPs labeled with fluorescein family dyes are substituted at least partially for 
one of the 4 standard dNTPs and a double mutant polymerase such as E681K E615G Taq DNA polymerase is 
included. The mutant polymerase synthesizes primer extension products that have fluorescein-labeled ribonucleotide 
residues at various positions along their lengths. Upon heat or alkali treatment, the primer extension products are frag- 
mented at each ribonucleotide residue, producing a population of end-labeled fragments. This population of uniformly 
labeled fragments represents a distribution of the fluorescent label across the length of the primer extension product. 
Labeled fragments of these characteristics are useful in nucleic acid detection formats based on silicon chips, such as 
that of (Cronin et al, supra,). Thus, in one embodiment, the invention provides a method of labeling primer extension 
products which comprises (1) providing a thermostable DNA polymerase which (a) in its native form comprises the 
amino acid sequence LSXXLX(V/I)PXXE (SEQ ID NO: 1) where X is any amino add, (b) the X at position 4 is mutated 
such that X at position 4 is not mutated to E, (c) also comprises the region of criticality which is amino acid sequence 
SQIXLR(V/0 (SEQ ID NO: 18) where "X" is any amino add except E and (d) is capable of effident incorporation of rftx)- 
nudeotides labeled with fluorescein and/or cyanine family dyes, and (2) performing a primer extension reaction. 
[0083] In yet another aspect enzymes having the critical nnotif of this invention display an increased rate of extension 
relative to the wild-type enzyme as shown in Example IV for a E681K F667Y mutant In one embodiment, the enzyme 
is characterized in that (1) in its native form, it comprises the amino acid sequence LSXXLX{V/I)PXXE (SEQ ID NO: 1) 
where X is any anrvno add. (2) the amino add sequence is mutated at position 4 such that X at posrtion 4 is not mutated 
to E. and (3) it has an increased extension rate relative to the wild-type enzyme, in a prefened enrixxliment. the enzyme 
is characterized in that (1) in its native form, it comprises the amino acid sequence LSXXLX(V/OPXXE (SEQ ID NO: 1) 
where X is any amino acid, (2) the amino add sequence is mutated at position 4 such that X at position 4 is not mutated 
to E and (3) it also comprises the amino add sequence MRRXXKXXNYXXXYG (SEQ ID NO: 12) where X is any amino 
acid and (4) has an increased extension rate. In a more preferred embodiment, the enzyme is characterized in that (1) 
in its native form it contains the amino acid sequence LSXXLX(V/I)PXXE (SEQ ID NO: 1) where X is any amino acid 
and (2) the amino acid sequence is mutated at position 4 such that X at position 4 is mutated to K. In a most prefen-ed 
embodiment, the enzyme is Taq DNA polymerase and contains the E681K mutation and the F667Y mutation. Also 
induded within this aspect are methods of sequencing and labeling of DNA using the polymerases with inaeased 
extension rate as well as kits for doing the same. 

[0084] In a preferred method for DNA sequendng according to the invention, thermostable pyrophosphatase is 
induded in the reaction mixture. Pyrophosphatase has been shown to enhance sequendng data using mesophilic as 
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well as the mutant thermostable DNA polymerases described in European Patent Application. Publication No. EP-A- 
763 599 and U.S. Patent No. 5.665,551. 

[0085] In an exemplified embodiment, the thernrwstable DNA polymerase of the invention also contains a mutation in 
the 5*-nuclease domain that serves to greatly attenuate this nuclease activity. Modified forms of Taq polymerase have 
been described in PCT Patent Publication No. WO 92/06200. published April 16. 1992 and in U.S. Patent No. 
5.466.591 . In one embodiment of that invention, the codon for the glycine residue at amino acid position 46 has been 
replaced with a codon for aspartic add (Q46D mutation). The resulting enzyme has enhanced utility in cycle sequenc- 
ing reactions due to the decreased 5'-nuclea&e activity. The polymerase domain amino acid sequence and polymerase 
activity are both unchanged in the G46D mutant In comparison to the wild-type enzyme. 

(0086] In a commercial embodiment of the invention, kits for practicing methods that are improved by use of the 
present invention are considered to be an additional aspect of the invention. One such Wt for DNA sequencing com- 
prises 

a) a thermostable DNA polymerase characterized in that 

i) said polymerase comprises the amino acid sequence 

LSXXU(V/I)PXXE (SEQ ID NO: 7) where X at position 4 can be any amino acid except E 

ii) said polymerase has reduced discrimination against incorporation of nucleotides labeled with fluorescein 
family dyes, and 

b) a dye-terminator labeled with a negatively charged fluorescent dye and may additionally include other reagents 
for DNA sequencing such as dNTPs. thernnostable pyrophosphatase and appropriate buffers. In another embodi- 
ment, the enzyme in the kit has the amino acid sequence LSVXLG(V/I)PVKE (SEQ ID NO: 15), whereby X is any 
amino acid except E. In the three-letter code, this amino add sequence is represented as LeuSerValXaaLeuGlyX- 
aaProValLysGlu (SEQ ID NO: 15), whereby "Xaa" at position 4 is any amino acid except Giu and "Xaa" at position 
7 is Val or lie. In a preferred embodiment, the amino acid sequence is LSVXLGVPVKE (SEQ ID NO: 16). where X 
is any amino acid except E. In the three-letter code, this amino add sequence is represented as LeuSerValXaa- 
LeuGlyValProValLysGlu (SEQ ID NO: 16), whereby "Xaa" at position 4 is any amino acid except E. In a more pre- 
ferred embodiment the "Xaa** at position 4 is Arg. In another preferred embodiment, the amino add sequence is 
LSVXLGIPVKE (SEQ ID NO: 17) where X is any amino add except E. In the three-letter code, this amino acid 
sequence is represented as LeuSerValXaaLeuGlylleProValLysGlu (SEQ ID NO: 17). whereby "Xaa" at position 4 
is any amino acid except Glu. In a more prefen-ed embodiment the "Xaa" at position 4 is Arg, 

[0087] Other kits for DNA sequendng comprise a mutant thermostable DNA polymerase characterized in that 

a) in its native form said polymerase comprises the amino add sequence LSXXLX(V/I)PXXE (SEQ ID NO: 1) 
where X is any amino acid. 

b) ^d amino acid sequence is mutated at position 4. except that X at position 4 is not mutated to E; and 

c) said th6rnx)stable DNA polymerase has reduced discrimination against incorporation of nudeotides labeled with 
fluorescein family dyes in conrtparison to the native form of said enzyme and may additionally tndude reagents for 
DNA sequencing such as chain terminating compounds, dNTPs, thermostable pyrophosphatase and appropriate 
buffers. Where the terminators are labeled, preferable laljels are fluorescent dyes, nrwre preferable labels are neg- 
atively charged fluorescent dyes or cyanine dyes, and the most preferable labels are fluorescein family dyes. In a 
preferred embodiment the enzyme in the kit has the amino add sequence LS(Q/G)XL(S/A)IPYEE (SEQ ID NO: 2). 
where X is any amino acid. In the three-letter code, this amino acid sequence is represented as LeuSerXaaXaa- 
LeuXaalleProTyrGluQIu (SEQ ID NO: 2), v^rtiereby "Xaa" at position 3 is Gin or Qly, "Xaa" at position 4 is any amino 
acid, and "Xaa" at position 6 is Ser or Ala. In a more preferred embodiment, the amino acid sequence is LSQX- 
LAIPYEE (SEQ ID N0:3), where X is any amino add. In the three-letter code, this amino acid sequence is repre- 
sented as LeuSerGlnXaaLeuAlalleProTyrGluGlu (SEQ ID N0:3), whereby "Xaa" at position 4 is any amino acid. In 
a most preferred embodiment, the "Xaa" at position 4 is Lys. 

[0088] Kits for labeFing DNA comprise a thermostable DNA pdynnerase whfch is characterized in that (a) in Its native 
form, the polymerase comprises the amino acid sequence LSXXLX(V/I)PXXE (SEQ ID NO: 1) where X is any an^no 
acid, (b) the X at position 4 in said sequence is mutated in comparison to said native form, except that X at position 4 
is not mutated to E, and c) the enzyme has reduced discrimination against incorporation of nudeotides labeled with flu- 
orescein family dyes, in comparison to the corresponding wild-type enzyme and may additionally indude dNTPs and 
appropriate buffers. In a prefeaed embodiment the X at position 4 is mutated to K. Other kits for produdng labeled DNA 
comprise a) a nucleotide or nucleotide analog labeled with a negatively charged fluorescent compound and b) a native 
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thermostable DNA polymerases having the following crKicat motif: 

LSXXLX(V/I)PXXE (SEQ ID NO: 7) 
where X at position 4 can be any amino acid except E and said polymerase has reduced disaimination against Incor- 
poration of fluorescein-labeled nucleotides, and may additionally include dNTPs and appropriate buffers. In a preferred 
embodiment, the X at position 4 is K. In another preferred embodiment, the X at position 4 is R. 
(0089] Kits for labeling primer extension products comprise a thermostable DNA polymerase which is characterized 
in that (a) in its native form, the polymerase comprises the amino acid sequence LSXXLX{V/I)PXXE (SEQ ID NO: 1) 
where X is any amino acid, (b) the X at position 4 in said sequence is mutated in comparison to said native sequence, 
except that X at position 4 is not mutated to E. c) the polymerase also connprises the second amino acid sequence 
SQIXLR(V/I) (SEQ ID NO: 18) where "X" is any amino acid except E. d) the enzyme has reduced discrimination against 
incorporation of ribonucleotides labeled with fluorescein family dyes, in comparison to the corresponding wild-type 
enzyme, and may additionally include a ribonucleotide or ribonucleotide analog labeled with a negatively charged fluo- 
rescent connpound or cyanine connpound, dNTPs, and appropriate buffers. In a preferred embodiment the polymerase 
contains a E681 K mutation arxl a E615G mutation. Other kits for producing labeled primer extension products comprise 
a) a rbonucleotide or ribonucleotide analog labeled with a negatively charged fluorescent compound or cyanine com- 
pound and b) a native thermostable DNA polymerase characterized in that it (i) corrprises the critical motif which is the 
amino acid sequence: 

LSXXU(V/I)PXXE (SEQ ID NO: 7) 
where X at position 4 can be any amino acid except E, (ii) comprises the second amino acid sequence SQIXLR(V/I) 
where "X" is any amino acid except E, and (iii) has reduced discrimination against incorporation of fluorescein-labeled 
ribonucleotides, and may additionally include dNTPs and appropriate buffers. In a prefen-ed embodiment, in the first 
amino acid sequence, the X at position 4 Is a K and in the second amino add sequence, the X is a G. In another pre- 
ferred embodiment, in the first amino acid sequence, the X at position 4 is a R and in the second amino acid sequence, 
the X is a G. 

[0090] The following examples are offered by way of illustration only and are by no means Intended to limit the scope 
of the claimed invention. 

Expression of a Modified Taa Polymerase Gene Having Reduced Disaimination Aoainst Nucleotides Labeled with Ru- 
orescein Family Dves 

[0091] The C-terminal amino add portion of Taq DNA polymerase encodes the polymerase active site domain (Law- 
yer et al., 1993, ECB Methods and A pplications 2:275-287. Freemont et al., 1986. Proteins: Structure. Function aod 
Genetics 1:66-73, which are incorporated herein by reference). A DNA fragment containing this region was isolated 
from the full-length Taq gene and mutagenized by PGR anrplif ication in the presence of manganese (Leung et al, 1 989, 
Technique 1(1): 11-15). For this example, ail restriction enzymes were purchased from New England Biolabs, Beverly 
MA. The mutagenized fragments were digested with Pst\ and BglW and doned Into a Taq expression plasmid. pLK102. 
which had been digested with Fs(l and BglW. Plasrrod pLK102 is a derivative of pLKIOI in which the 900 bp Pstl-BglW 
fragment is replaced by a short P$t\'Bgl\\ Unto. Plasmid pLKIOI is a modified iom of pSYC1578 (Lawyer ef a/.. 1993, 
supra and U.S. Patent No. 5.079.352 ), in which the small HincWIEcom fragment located 3' to the polymerase coding 
region was deleted. 

[0092] The resulting expression plasmids were transformed into £ coli strain N1624 (available from the E, coll 
Genetic Stock Center at Vfeile University, strain No. CGSC #5066) and the resulting transformants were screened for the 
ability to more efficiently incorporate ja-^^PJTet-dCTP in conparison to the wild-type enzyme. Using this procedure 
Mutant CS1, was identified as having the ability to more efficiently incorpaate [a-^PJTet-dCTP. The mutagenized Taq 
expression plasmid of mutant CS1 was digested with Hin6\\yNhe\ and the resulting restriction fragment was subdoned 
into the wild-type gene of pLKIOI, replacing the unnurtagenized HindlWNhel fragment, to determine which portion of 
the mutagenized Taq polymerase gene was responsittle for the altered phenotype. Subclones containing the 
H/ndllll/A//»l restriction fragment conferred the altered phenotype on the wild-type enzyme, indicating that the mutation 
was within this fragment Subsequent subclone analysis determined that the mutation was located in the 265 bp 
BamH\'Nhe\ fragment. 

{0093] DNA sequence analysis of the 265 Nhe\'BamH\ fragment was performed on pCSI using the TaqFS DyeDe- 
oxy™ Terminator Cyde Sequendng Ktt from Applied Biosystems. Foster City. CA. and the Applied Biosystems Model 
Prism 377 DNA Sequencing System. The sequence analysis identified a missense mutation in the Taq polymerase 
gene at amino add position 681 . that caused a Glutamic acid (E) residue to be replaced by a Lysine (K) residue. Num- 
bering is initiated at the codon encoding the first methionine residue of the nature protein, as in U.S. Patent No. 
5,079,352. which is herein incorporated by reference. This mutation, E681 K. specifically was caused by a GAG to AAG 
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change in the ccxion sequence. Plasmid pCSI was deposited with the ATCC on August 28, 1997, and given accession 
No. 98521. 

[0094] Plasmid pCSl may contain additional nrwtations in the coding sequence for Taq polymerase; however, by fur- 
ther subdoning experiments, the E681 K mutation was determined to be solely responsible for the increased efficiency 
in incorporation of nucleoside triphosphates labeled with fluorescein dyes. This point mutation is located in the 265 
base pair BamHUNhel DNA fragment shown in Figure 1. Within the 265 bp DNA fragment, the E681K mutation is the 
only change from the wild-type Taq polymerase gene sequence. 

[0095] For further analysis and quantitation of the efficiency of incorporation of nucleotide analogues, the 265 bp 
BamH\-Nhe\ fragment of plasmid pCSl was cloned into a Taq expression vector that contained the wild-type sequence 
within the polymerase domain. pRDA3-2. Plasmid pRDA3-2 referred to as clone 3-2. is fully desaibed in PCT Patent 
Publication No. WO 92/06200. which is incorporated herein by reference. A second clone encoding both the E681K 
nuitation as well as a F667Y mutation was created by primer-directed mutagenesis and subsequent cloning of a PCB 
product containing both mutations into the BamHUNhel sites of plasmid pRDA3-2. 

[0096] Expression vector pRDA3-2 contains the full-length Taq DNA polymerase gene operaWy linked to the phage 
lambda Pl pronxrter. In vector pRDA3-2. the 5*-nuclease domain of the Taq DNA polymerase gene contains a point 
mutation at the codon encoding glycine at position 46 that reduces 5*-nuclease activity (G46D mutation). However, the 
gene sequence within the polymerase domain of the expression vector pRDA3-2 is identical to the wild-type Taq DNA 
polymerase gene sequence. Plasmids. pRDA3-2, pCSI and the E681 K F667Y PGR product were digested with BamHI 
and Nhe\ and the 265 bp DNA fragment from plasmid pCSI or the PGR product was ligated into vector pRDA3-2 by 
conventional means. The resulting plasmids, pLK112 and pLK113. respectively, were transformed into £ co// strain 
DG1 16 (ATCG No. 53606). These plasmids encode thermostable DNA polymerases herein referred to as G46D E681 K 
Taq and G46D E681K F667Y Taq, respectively. The expressed thermostable DNA polymerase protein G46D E681K 
F667Y Taq was purified according to the method described by Lawyer et al., 1993. supra^ 

10097] The G46D E681 K Taq enzyme was purified using a similar, but smaller scale preparation method as follows: 
All steps were preformed at 4" C unless indicated othenwise. Gells from a 475 ml culture were resuspended in 30 ml of 
buffer (50 mM Tris-HCI. pH 7.5. 10 mM EDTA, pH 8.0, 0.5 mM Pefabloc^SC. 0.5 jig/m! leupeptin. 0.1 mM Na-p-tosyl- 
L-Lysine Chloromethyl Ketone, 1 mM DTT). Gells were sonicated at 50% duty cycle, setting 5 for 1 minute, and cooled 
on ice for 1 minute. This step was repeated twice more. Then 1.5 ml of 4.0 M amnrionium sulfate was added and the 
mixture heated in a 75*G water bath for 15 minutes, followed by cooling on ice. Polyethyleneimine was added to 0.6% 
and the mixture was inculcated on ice for 10 minutes. The mixture was centrifuged at 16,000xg for 30 minutes. The 
supernatant was loaded on a 1.8 ml volume phenyl-sepharose column (Bio-rad Polyprep chromatography column) 
equilibrated with a solution of 50 mM Tris-HQ, pH 7.5, 10 mM EDTA. pH 8.0. 1 mM DTT, 0.2 M (NH4)2S04. The column 
was washed with 6 ml each of three solutions: 1) 25 mM Tris-HGI, pH 7.5. 1 mM EDTA. 1 mM DTT, 0.2 M (NH4)2S04. 
2) 25 mM Tris-HQ. pH 7,5. 1 mM EDTA, 1 mM DTT, and 3) 25 mM Tris-HGI. pH 7.5, 1 mM EDTA. 1 mM DTT, 20 % 
ethylene glycol. The polymerase was eluted with 6 ml of 25 mM Tris-HQ. pH 7.5, 1 mM EDTA. 1 mM DTT. 20% ethylene 
glycol. 2.5 M urea. After adjusting the polymerase preparation to 100 mM KG! with 3M KGI, the mixture was loaded on 
a heparin-sepharose column (1 .8 ml volume, Bio-rad Poly-prep column) equilibrated in 25 mM Tris-HGI, pH 7.5, 1 mM 
EDTA. 1 mM DTT. 100 mM KGI. After a wash with the same buffer, the sample was eluted in a buffer of 25 mM Tris-HGI, 
pH 7.5. 1 mM EDTA. 1 mM DTT, 400 mM KCI. 

[0098] Following purification, the activity of the modified enzymes was determined by the activity assay described in 
Lawyer et al,, 1989. iL EioL Chem. 264:6427-6437. which is incorporated herein by reference. The activity of the puri- 
fied enzymes was calculated as follows: one unit of enzyme con-esponds to 10 nmoles of product synthesized in 30 min. 
DNA polymerase activity is linearly proportional to enzyme concentration up to 80-100 pmoles dGMP incorporated 
(diluted enzyme at 0.024-0.03 units/jil). The purified enzymes were utilized in the incorporation and sequencing reac- 
tions desaibed in Examples ll-IV. 

Example II 

Assay to Compare Eff iciencv of Incorporation of ddNTPs 

[0099] The relative abilities of G46D F667Y Taq, G46D F667Y E681 K Taq and F730Y rma30 DNA polymerases to 
incorporate a fluorescen dye family-labeled ddGTP were compared by use of a limiting template, primer extension 
conpetition assay. F730Y rma30 DNA polymerase is described in Example I of U.S. Serial No 60/052065, f fled July 9. 
1997 and in European Patent Application No. 981 12327.6, both herein incorporated by reference. In this competition 
assay, because the incorporation of a ddGTP terminates the extension reaction, the more readily the polymerase incor- 
porates a ddGTP into an extended primer, the less [a-^P]dGTP can be incorporated. Thus, as the efficiency of ddGTP 
incorporation increases, the extent of inhibition of DNA synthesis is inaeased. The efficiency of incorporation of ddGTP 
is then conpared to the efficiency of incorporation of f luorescently labeled ddGTP to give a relative measurement of the 
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efficiency of incorporation of fluorescently-labeled ddNTPs for a given enzyme. 

[01 00] The assay was performed as previously described (Lawyer et aL . 1 989. iL BioL Chem. 264 :6427) including the 
foilcwing nrxxJifications. The assay mixture was composed so the final concentration was 50 mM Bicine pH 8.3, 25' C. 
2.5 mM MgCl2. 1 mM p-mercaptoethanol. 20 nM each of dATP, dGTP and dTTP (Perkin-Elmer). 20 \iM dCTP (PerWn- 
Elmer) and [a-^PJdCTP (New England Nuclear. Boston. MA). M13mp18 (Perkin-Elmer) was annealed to primer DG48. 
(SEQ ID NO: 10) and the equivalent of 0.085 pmoles of the annealed template was added to the assay mixture for each 
reaction. Thirty-five ^il of the assay mixture with template DNA was added to each of 380.5 ml eppendorf tubes. Dilu- 
tions of Zowie-ddCTP in 25 mM CAPSO buffer, pH 9.6 were prepared such that when 10 ^1 of each was added to the 
reaction tube, the final concentration of Zowie-ddCTP would be 3. 1. 0.5, 0.25. 0.125, or 0.0625 jiM. For G46D F667Y 
Taq DNA polymerase, two tubes each of the 3, 1 . 0.5, 0.25. 0.125 ^M Zowie-ddCTP were prepared. For G46D F667Y 
E681K Taq and F730Y fmaSO DNA polymerases, two tubes each of the 1. 0.5. 0.25. 0.125. and 0.0625 fiM Zowie- 
ddCTP were prepared. The eight remaining reaction tubes received 10 of 25 mM CAPSO buffer, pH 9.6. Thus, each 
of the thirty-eight tubes contained 35 \i\ of assay mix and 10 fil of either 25 mM CAPSO buffer. pH 9.6 or one of the 
Zowie-ddCTP dilutions. 

[0101] For each enzyme to be tested, polymerization was initiated in one tube of each Zowie-ddCTP dilution and two 
tubes containing the CAPSO buffer alone using 5 \i\ of the enzyme. The following concentrations of the enzymes were 
used, each predetermined to be an excess amount of enzyme for the amount of substrate in the assay: 2.5 units of 
F667Y G46D Taq DNA polymerase prepared as in Example I; 1 .25 units of G46D. F667Y. E681 K Taq DNA polymerase, 
prepared as in Example I; or 2 units of F730Y T/naSO DNA polymerase. As a control for the level of background, the 
remaining negative control was initiated with enzyme dilution buffer rather than polymerase. All reaction tubes were 
immediately vortexed briefly and incubated for 10 minutes at 75' C. The reactions were stopped by addition of 10 ^1 60 
mM EDTA and stored at 0* C. 

[01 02] In an analogous experiment, ddCTP was diluted in 25 mM CAPSO buffer, pH 9.6 such that when 1 0 jil of each 
dilution was added to the reaction tubes, the final concentration would be 0.5, 0.25, 0.125, 0.0625. or 0.0312 fiM. Ten 
^1 of each dilution was pipetted into each of three 0.5 ml Eppendorf tubes containing 35 |il of the assay mixture as 
described above. Four tubes containing 35 jil of the assay mix plus 10 |xl of 25 mM CAPSO buffer. pH 9.6 were also 
prepared. Thus, each of the 19 tubes contained 35 \i\ of assay mix and 10 jil each of either 25 mM CAPSO, pH 9.6 or 
one of the ddCTP dilutions. 

[0103] Polymerization was initiated in one tube of each ddCTP dilution and one tube of CAPSO buffer with 2.5 units 
of G46D F667Y Taq DNA polymerase, 1.25 units of G46D F667Y E681K Taq DNA polymerase or 2 units of F730Y 
TmaZO DNA polymerase. The remaining tube containing CAPSO was initiated with enzyme dilution buffer rather than 
the polymerase-containing buffer as a negative control. All reactions were immediately vortexed and incubated 10 min- 
utes at 75** C. The reactions were stopped by addition of 10 miaoliters of 60 mM EDTA and stored at 0* C. 
[0104] For each reaction, a 50 ^il aliquot of the 60 |il reaction was diluted with 1 ml 2 mM EDTA. 50 fig/ml sheared 
salmon sperm DNA as a carrier. The DNA was precipitated with TCA using standard procedures and collected on GF/C 
fitter discs (Whatman). The amount of incorporated [a-^^PJdCMP was determined for each sample and normalized to 
the CAPSO samples without ddNTP (0% inhibition). The concenUation of ddCTP or Zowie-ddCTP needed for 50% inhi- 
bition was calculated for each sample and is shown in Table 2. Comparison of the amount of ddCTP needed to inhibit 
synthesis 50% with the amount of Zowie<JdCTP required to inhibit synthesis by 50% for a particular enzyme reflects 
the relative abifity of each enzyme to incorporate fluorescently-labeled analog. These data show that G46D F667Y Taq 
DNA polymerase incorporates Zowie-ddCTP least efficiently of the three enzymes tested (ratio of concentrations for 
50% inhibition by Zowie^ldCTP vs. ddCTP = 25). F730Y TmaZO DNA polymerase incorporates this labeled analog 
more efficiently than G46D F667Y Taq DNA polymerase (ratio of concentrations for 50% inhibition by ZowieKldCTP vs. 
ddCTP = 4), while G46D F667Y E681K Taq DNA polymerase incorporates labeled and unlabeled ddCTP with nearly 
equal efficiency (ratio of concentrations for 50% inhibition by Zowie-ddCTP vs. ddCTP=1 .2). 



Table 2 



Concentration (jiM) of Zowie-ddCTP or ddCTP needed for 50% inhibition 


DNA polymerase 


Zowie-ddCTP 


ddCTP 


Zowie<JdCTP/ddCTP 


G46D F667Y Taq 


1.4 


0.056 


25 


G46D F667Y E681 K TaQ 


0.14 


0.116 


1.2 


F730Y rma30 


0.236 


0.057 


4 
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Example III 
Extension Rate Assay 

5 [0105] The extension rate of G46D F667Y Tag and G46D F667Y E681K Taq were determined using an extension 
rate assay. In this experiment, the enzymes were used to extend a primer annealed to an M 1 3 template in the presence 
of [a-^P^lCTP. The extension reactions were denatured and the products analyzed by denaturing agarose gel electro- 
phoresis. 

[01 06] The assay was performed as previously described (Uwyer et aL, 1 989. i SioL Chem. 264 :6427) including the 
10 following nrxxjif ications. The assay mixture was composed so the final concentration was 50 mM Biclne pH 8.3, 25* C. 
2.5 mM MgCl2. 1 mM p-mercaptoethanol, 200 jiM each of dATP. dGTP and dTTP (Perkin-Elmer). 100 fiM dCTP (Per- 
kin-Elmer) containing [a-^P]dCTP (New England Nuclear. Boston, MA). M13mpl8 (Perkin-Elmer) was annealed to 
primer DG48, (SEQ ID NO: 1 1), and the equivalent of 0.085 pnrwies of the annealed template was added to the assay 
mixture for each reaction. Forty-five ^il of the assay mixture with template DNA was added to each of fourteen 0.5 ml 
IS eppendorf tubes. Each tttoe was preincubated at 75** C for at least 30 seconds before the start of the polymerase reac- 
tion. 

[0107] Polymerization was initiated in six of the fourteen assay tubes with 5 ^1 of G46D F667Y Taq DNA polymerase 
(2.5 units) or G46D F667Y E681 K Taq DNA polymerase (1 .25 units). Both enzymes were prepared as in Example I and 
the concentration used represents a predetermined excess amount of enzyme for the anrwunt of substrate in the assay. 

20 As a control for the level of background, the remaining negative control was initiated with enzyme dilution buffer rather 
than polymerase. All reaction tubes were immediately vortexed briefly and incubated at 75* C. Two of the six tubes con- 
taining G46D F667Y Taq DNA polymerase were incubated 3 minutes, two for 6 minutes and two for 10 minutes. Simi- 
larly, two of the tubes started with G46D F667Y E681 K Taq DNA polymerase were incubated for 30 seconds, two for 1 
minute and two for 2 minutes. The control tubes were incubated for 3 minutes. The reactions were stopped by addition 

25 Of 10 \i\ 60 mM EDTA and stored at 0** C. 

[0108] For each reaction, a 25 aliquot of the 60 nl reaction was diluted with 1 ml 2 mM EDTA. 50 jig/ml sheared 
salmon sperm DNA as a carrier. The DNA was precipitated with TCA using standard procedures and collected on GF/C 
filter discs (Whatman). The amount of incorporated [a-^^P]dCMP was determined for each sample. 
[0109] The remaining 35 ^il of each duplicated were combined and the 70 \i\ sample was ethanol precipitated, dried 

30 and resuspended in 50 mM NaOH, 1 mM EDTA. Aliquots were removed from these samples such that an equal number 
of [a-^^P] counts were taken from each. These aliquots were loaded on an 0.9% alkaline agarose gel. electrophoresed, 
dried and autoradiographed as previously described (Maniatis et aL, 1982, In Molecular Cloning: A Laboratory Manual. 
Cold Spring Hartwr Laboratory. CoW sipring Harbor. NY). Bacteriophage lambda DNA cut with restriction enzyme 
Hin6\\\ (BRL) and 5* end-labeled with p^P] was used as a molecular weight standard. 

35 [01 1 0] The length in base pairs of the extension product in each sample was determined by comparison of the migra- 
tion distance of each sample with the distance migrated by the lambda DNA size standard. The number of base pairs 
In each product was divided by the number of seconds each extension reaction incubated to give the extension rate as 
shown t^elow. 

40 



DNA polymerase 


Time 


Base Pairs/Sec 


G46D F667Y Taq 


3 min. 


12.5 


G46D F667Y Taq 


6min. 


12.2 


G46D F667Y Taq 


10 min. 


11.8 


G46D F667Y E681K Taq 


30 sec. 


36.7 


G46D F667Y E681K Taq 


1 min. 


41.7 


G46D F667YE681K Taq 


2 min 


52.9 



(0111] These results indicate that the presence of the E681 K mutation increases the extension rate of a G46D F667Y 
55 enzyme by 3- to 4.3- fold. 
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Ex^rpplg IV 

Cvde SeQuencino wrth G46D F667Y E68 1K Taa DNA Poivmerase and Fluorescein Ubeled cJdNTPs 

5 [0112] This example demonstrates the application of the modified polymerase of the invention to fluorescein dye 
labeled dideoxyterminata cycle sequencing, utilizing 1 jiMor lessddNTP andaratioof ddNTPidNTP of at least 1:100. 
The fluorescein dye labeled dideoxy terminators are reagents from the Applied Biosystems PRISM Sequenase<» Ter- 
minator Sequencing Kits (Perkin-Elmer. Nonwalk. CT) and were optimized for use with Sequenase DNA polymerase 
and alpha-thio dNTPs. Cyde sequencing reactions were performed in a 20 jil volume containing 50 mM Tris-HCI (pH 

TO 8.8). 2.0 mM MgCl2. 100 \ihA each dATP. dCTP, and dTTP (Perkin-Elmer. Nonwalk, CT). 500 |iM dITP (Pharmada Bio- 
tech. Piscataway. NJ). 0.2 ng M13mp18 single-strand DNA template (PerWn-Elmer), 0.15 jiM LacZ Fonward Primer 
(Perkin-Elmer). 5 units of G46D F667Y E681 K Taq DNA polymerase. 20 units of rTth Thermostable Pyrophosphatase 
(European Patent Application. Publication No. EP-A-763 599 and U.S. Patent No. 5.665,551). 0,05 ^M Sequenase A 
Dye Terminator. 0.80 ^iM Sequenase C Dye Terminator, 0.08 ^M Sequenase G Dye Terminator, and 1 .0 ^M Sequenase 

15 T Dye Terminator. All four Sequenase Dye Terminatas were purchased from Perkin-Elmer. Reactions were placed in a 
preheated (75" C) Perkin-Elmer GeneAmp® PGR System 9600 thermal cyder and subjected to 25 cycles of 96" C for 
10 seconds, 50" C for 5 seconds, and 60" C for 4 minutes. Dye labeled fragments were purified with Centri-Sep™ col- 
umns (Princeton Separations, Adelphia. NJ) following the nr^nufacturer's instructions and dried in a vacuum centrifuge. 
Pellets were resuspended in 6 \i\ of deionized formamide:50 mg/mL Blue dextran (in 25 mM EDTA, pH 8.0) 5:1 (v/v), 

20 heated at 90" C for 3 minutes, and directly loaded onto a pre-electrophoresed 4% polyaaylamide/6 M urea gel and 
electrophoresed and analyzed on a Perkin-Elmer ABI PRISM' 377 DNA Sequencer according to the nr^nufacturer 
instructions (ABI PRISM 377 DNA Sequencer User's Manual). Automated base-calling by the Perkin-Elmer ABI PRISM 
377 DNA Sequencer analysts software resulted in greater than 98.5% accuracy for 450 bases (6 errors for bases +10 
to +460 from primer). 

25 
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SEQUENCE LISTING 



1) GENERAL INFORMATION: 



(i) APPLICANT: 

(A) NAME: F . Hoffmann -La Roche Ltd 

(B) STREET: Grenzacherstrasse 124 

(C) CITY: Basel 

(D) STATE: BS 

(E) COUNTRY: Switzerland 

(F) POSTAL CODE (ZIP): CH-40"70 

(G) TELEPHONE: i0)61 688 24 03 

(H) TELEFAX: (0)61 688 13 95 

(I) TELEX: 962292/965512 hlr ch 

(ii) TITLE OF INVENTION: Altered Thermostable DNA Polymera: 

for Sequencing 



(iii) NUMBER OF SEQUENCES: 18 



(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 (EPO) 



(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/052.065 

(B) FILING DATE: 09 -JUL- 19 9 7 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label= Xaa 
/note= "Xaa at position 7 is Val or lie." 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

Leu Ser Xaa Xaa Leu Xaa Xaa Pro Xaa Xaa Glu 
15 10 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 3..€ 

(D) OTHER INFORMATION: /lab€l= Xaa 
/note= 'Xaa at position 3 is Gin or Gly. Xaa at position 6 is 
Ser or Ala. • 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Leu Ser Xaa Xaa Leu Xaa lie Pro Tyr Glu Glu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

Leu Ser Gin Xaa Leu Ala He Pro Tyr Glu Glu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 4: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Region 
^ (B) LOCATION: 7 

(D) OTHER INFORMATION: /label= Xaa 
/note= "Xaa at position 7 is Val or lie." 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

^ Leu Ser Val Xaa Leu Gly Xaa Pro Val Lys Glu 

1 5 10 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
SO (A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Leu Ser Val Xaa Leu Gly Val Pro Val Lys Glu 
1 5 10 

10 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

,5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Leu Ser Val Xaa Leu Gly He Pro Val Lys Glu 
1 5 10 

25 (2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY: Region 
55 (B) LOCATION: 4. .7 

(D) CXTHER INFORMATION: /labels Xaa 
/note= -Xaa at position 4 is any amino acid but not a glutamic 
acid residue. Xaa at position 7 is Val or He." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Leu Ser Xaa Xaa Leu Xaa Xaa Pro Xaa Xaa Glu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 8: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

SO (ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 



55 
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(A) NAME/KEY: Region 

(B) LOCATION: 4 

(D) OTHER INFORWATION: /label= Xaa 
/noce= "Xaa at position 4 is any amino acid except Glu." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 2.. 5 

(D) OTHER INFORMATION: /label= Xaa 
/note= "Xaa at position 2 is Ser or Ala, Xaa at Position 4 is 
any amino acid except Glu and Xaa at Position 5 is Leu or He. 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 

Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 10: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /label= Xaa 
/note= "Xaa at positioin 4 is any amino acid except Glu." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Leu Ser Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa Glu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGGAAGGGCG ATCGGTGCGG GCCTCTTCGC 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Arg Arg Xaa Xaa Lys Xaa Xaa Asn Tyr Xaa Xaa Xaa Tyr Gly 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(CI STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 3. .6 

(D) OTHER INFORMATION: /label= Xaa • • ^ 

/note= -xaa at position 3 is Gin or Gly. Xaa at J:^^^^^ is 
any amino acid except Glu. Xaa at position 6 is Ser or Ala. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Leu Ser Xaa Xaa Leu Xaa lie Pro Tyr Glu Glu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 
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10 



15 



30 



35 



40 



SO 



(A) NAME/KEY: Region 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /label= Xaa 
/note= -Xaa at position 4 is any amino acid except Glu. 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Leu Ser Gin Xaa Leu Ala He Pro Tyr Glu Glu 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 
20 (A) NAME/KEY: Region 

(B) LOCATION: 4. .7 

(D) OTHER INFORMATION: /label= Xaa 
/note= "Xaa at position 4 is any amino acid except Glu. Xaa at 
position 7 is Val of He." 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Leu Ser Val Xaa Leu Gly Xaa Pro Val Lys Glu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /label= Xaa 
/note= "Xaa at position 4 is any amino acid except Glu." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

^ Leu Ser Val Xaa Leu Gly Val Pro Val Lys Glu 

15 10 

(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /label= Xaa 
/note= "Xaa at position 4 is any amino acid except Glu." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Leu Ser Val Xaa Leu Gly He Pro Val Lys Glu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE; 

(A) NAME/KEY: Region 

(B) LOCATION: 4.. 7 

(D) OTHER INFORMATION: /label= Xaa 
/note= "Xaa at position 4 is any amino acid except Glu. Xaa at 
position 7 is Val or He." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Ser Gin He Xaa Leu Arg Xaa 
1 5 



Claims 

1 . A recombinant thermostable DNA polymerase which is characterized in that 

a) in its native form said polymerase comprises the amino acid sequence LeuSerXaaXaaLeuXaaXaaProXaaX- 
aaGlu (SEQ ID NO: 1). whereby "Xaa" at positions 3. 4. 6, 9. and 10 oi said sequence are any amino acid res- 
idue, and "Xaa" at position 7 of said sequence is Val or lie; 

b) said "Xaa" at position 4 is mutated in comparison to said native sequence, except that "Xaa" at position 4 is 
not mutated to Gu: and 

c) said thernfx>stable DNA polymerase has a level cf discrimination against inoorpaation of nucleotides labeled 
with fluorescein family dyes which is reduced in comparison to the native form of said polymerase. 

2. The recombinant thernrrostable DNA polymerase of daim 1 wherein said nucleotide is a dideoxynucleotide and said 
level of discrimination is at least 3-fold lower than that of said native form of said polymerase. 

3. The recombinant thermostable DNA polymerase of claim 2 wherein said level of discrimination is measured by 
determining the concentration of a dideoxynucleotide labeled with a fluorescein dye that is required for 50% inhlbi- 
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tion of DNA synthesis 

4. The recombinant thermostable DNA polymerase of claim 2 wherein said polymerase is from a thermophilic species 
selected from the group consisting of Thermosipho africanus. Bacillus caldotenax, and Bacillus 
stearothermophilus. 

5. The recombinant thermostable DNA polymerase of claim 2 wherein said polymerase is from a Thermus species. 

6. The recont)inant thermostable DNA polymerase of claim 5 which is characterized in that 

a) in its native form said polymerase comprises the amino acid sequence LeuSerXaaXaaLeuXaalleProTyr- 
GluGlu (SEQ ID NO: 2). whereby "Xaa" at position 3 is Gin or Gly, "Xaa" at position 4 is any amino acid, and 
"Xaa" at position 6 is Ser or Ala 

b) said "Xaa" at position 4 is mutated in comparison to said native sequence, except that "Xaa" at position 4 is 
not mutated to Glu; and 

c) said thernnostable DNA polymerase has a level of discrimination against incorporation of nucleotides labeled 
with fluorescein family dyes which is reduced in comparison to the native form of said polymerase, 

7. The recombinant thermostable DNA polymerase of claim 6 which Is characterized in that 

a) in its native form said polymerase comprises the amino acid sequence LeuSerGlnXaaLeuAlalleProTyr- 
GluGlu (SEQ ID N0:3). whereby "Xaa" at position 4 is any amino acid 

b) said "Xaa" at position 4 is mutated in comparison to said native sequence, except that "Xaa" at position 4 is 
not nrwtated to Glu; and 

c) said thermostable DNA polymerase has a level of discrimination against incorporation of nucleotides labeled 
with fluorescein family dyes which is reduced in comparison to the native form of said polymerase. 

8. The recombinant thermostable DNA polymerase of claim 7 which is characterized in that said "Xaa" at position 4 
is mutated to Lys. 

9. The recombinant thermostable DNA polymerase of daim 2 wherein said polymerase is from a Thermotoga spe- 
cies. 

10. The recombinant thermostable DNA polymerase of claim 9 which is characterized in that 

a) in its native form said polymerase comprises the amino acid sequence LeuSerValXaaLeuGlyXaaProVal- 
LysGlu (SEQ ID NO: 4), whereby "Xaa" at position 4 is any amino acid, preferably Arg. and "Xaa" at position 7 
is Val or lie. 

b) said "Xaa" at position 4 is mutated in comparison to said native sequence, except that "Xaa" at position 4 is 
not nxitated to Gu; and 

c) said thermostable DNA polymerase has a level of disaimination against incorporation of nucleotides labeled 
with fluorescein family dyes which is reduced in comparison to the native form of said polymerase. 

11. A nucleic add sequence encoding a recombinant thermostable DNA polymerase as claimed in any one of daims 
1 to 10. 

12. A vector comprising a nucleic add sequence encoding a recombinant thermostable DNA polymerase as claimed 
in any one of daims 1 to 10. 

13. A host cell comprising a nudeic acid sequence encoding a recombinant thermostable DNA polymerase as claimed 
in any one of dain^ 1 to 10. 

14. A method for preparing a recombinant thermostable DNA polymerase as daimed in any one of daims 1 to 10. said 
method comprises: 

(a) culturing a host cell comprising a nudeic acid sequence encoding a recombinant thermostable DNA 
polymerase as daimed in any one of daims 1 to 10 under conditions which promote the expression of the 
recombinant thernrrastable DNA polymerase; and 
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(b) isolating the recombinant thermostable DNA polymerase from the host cell or from the medium, 

15. A recombinant thernrxDStable DNA polymerase prepared by the method as claimed in claim 14. 

16. A method of DNA sequencing which method comprises: 

a) providing a recombinant thermostable DNA polymerase as claimed in any one of claims 1 to 10; 

b) providing a dye-terminator labeled with a negatively charged fluorescent dye; and 

c) performing a dye-terminator sequendng reaction. 

17. The method of claim 16 wherein the recon]t>inant thernrwstable DNA polymerase has a reduced level of discrimi- 
nation against incorporation of nucleotides labeled with fluorescein family dyes and wherein the said nucleotide is 
a dideoxynucleotide and said level of disaimination is measured by determining the ratio of the concentration of a 
dideoxynudeotide labeled with a fluorescein dye required for 50% inhibition of DNA synthesis versus the concen- 
tration of an unlabeled dideoxynudeotide required for 50% InhibKion. 

18. The method of claim 17 wherein said ratio is 4 or less. 

19. A method of produdng labeled DNA which method comprises: 

a) providing a recombinant thermostable DNA polymerase as daimed in any one of claims 1 to 10; 

b) providing a nucleotide labeled with a fluorescein family dye; and 

c) performing a DNA synthesis reaction. 

20. A metiiod of produdng labeled primer extension products which method comprises: 

a) providing a recombinant thermostable DNA pdymerase as claimed in any one of claims 1 to 10; 

i) said polymerase comprises the amino acid sequence LeuSerValXaaLeuGlyXaaProValLysGlu (SEQ ID 
NO: 4), whereby "Xaa* at position 4 can be any amino add except Glu. and "Xaa" at position 7 of tiiis 
sequence is Val or lie, 

ii) said polymerase has a reduced level of discrimination against incorporation of nucleotides labeled with 
fluorescein family dyes; 

iii) said polymerase also comprises ttie second amino add sequence SQ1XLR(V/I) (SEQ ID NO: 18) where 
"X" is any amino acid except E, 

iv) said polymerase has reduced discrimination against incorporation of ribonucleotides labeled with fluo- 
rescein family dyes; 

b) providing a rtoonucleotide labeled with a fluorescein family dye. and 

c) performing a primer extension reaction. 

21. The use of a recombinant therniostable DNA polymerase as daimed in any one of claims 1 to 10 in a in vitro DNA 
synthesis application, such as DNA sequendng. syntiiesis of labeled DNA and the production of labeled primer 
extension products. 

22. A kit for DNA sequendng whuch comprises a thermostable DNA polymerase as daimed in any one of claims 1 to 
10 and a terminator labeled with negatively-charged fluorescent dye. 

23. The kit of daim 22 wherein the said recombinant thernriostable DNA polymerase has reduced discrimination 
against Incorporation of nudeotides labeled with fluorescein family dyes and wherein said reduced level of discrim* 
ination is measured by determining the ratio of the concentration of ddNTP labeled with a fluorescein family dye 
required for 50% inhibition of DNA synthesis compared to that for an unlabeled ddf^ and said ratio is 4 or less. 

24. The kit of daim 23 wherein said level of disaimination is at least 5-fold lower than that of said native form of said 
polymerase. 

25. A kit for producing labeled DNA which conprises a recombinant thernrrostable DNA polymerase as claimed in any 
one of claims 1 to 10 and a nucleotide labeled with a negatively-charged fluorescent dye. 
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26. A kit for prcxiucing labeled primer extension products which comprises a recombinant thermostable DNA polymer- 
ase as claimed in any one of claims 1 to 10 and a ribonucleotide labeled with a fluorescein family dye. 
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